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This  report  describes  the  work  performed  by  Scientific  Systems,  Inc.  from  10/15/88 
to  4/30/88  under  the  Navy  Award  No.  N00014-88-C-0736.  The  work  represents  the  first 
of  a  three-phase  project  aimed  at  development  of  theory,  computational  techniques  and  a 
commercial  grade  software  for  solving  decentralized  estimation  problems  in  the  presence  of 
correlated  sensor  noise.  Solutions  are  available  for  the  correlated  sensor  noise  only  In 
this  report,  the  asailable  results  have  been  extended  in  many  directions  and  the  distributed 
tracking  problem  has  been  solved  with  correlated  sensor  measurement  noise  The  objective 
of  Phase  I  effort  was  to  demonstrate  the  feasibility  of  a  solution  techmque  of  this  problem. 
The  problem  scenario  will  be  broadened  and  the  results  will  be  analyzed  in  detail  in  Phase 
II. 
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SUMMARY 


The  research  proposed  under  this  project  is  concerned  with  the  problem  of  accurate 
target  tracking  in  the  SD!  environment.  The  SDl  scenario  considered  in  this  report  is  as 
follows.  Mulnple  sensors  that  are  possibly  located  at  dispersed  geographical  locations  are 
observing  a  single  target.  The  measurement  noises  are  correlated  across  the  sensors.  The 
local  processors  attached  with  the  sensors  possibly  may  not  have  the  complete  information 
of  the  target  model  The  tracking  problem  can  be  solved  in  two  ways'  first,  the  sensor 
measurements  can  be  transmitted  to  a  central  node  where  the  optimal  state  can  be  esnmated 
conditional  on  all  the  measurement  history  -  this  is  centralized  processing  Alternatively, 
each  sensor  can  process  its  own  measurement  locally  and  send  (some  function  of) 
processed  data  to  the  central  node  Here  it  is  fused  with  other  incormng  data  into  the  global 
esttmate  -  this  is  decentralized  processing.  The  latter  is  superior  to  the  centralized 
processing  from  the  system  performance  considerations  in  the  face  of  node  failures 

There  exists  a  good  amount  of  literature  in  the  area  of  decentralized  estimation  in 
presence  of  uncotrelated  sensor  noise.  In  this  work,  the  available  results  have  been 
extended  to  the  case  of  correlated  sensor  noise  under  the  assumption  that  the  local 
processors  have  the  knowledge  of  the  target  model.  Each  local  agent  needs  to  send  the 
central  node  only  one  vector  of  the  dimension  of  the  state.  The  available  results  for  the 
uncorrelated  sensor  noise  have  been  extended  in  other  directions  too.  For  example,  it  has 
been  established  that  there  is  a  tradeoff  between  the  computational  requirements  at  a  node 
and  the  transmission  requirements  from  this  node  Using  this  analysis  and  by  assigning 
the  major  Computational  load  to  the  local  nodes,  the  complexity  of  the  central  node  has  been 
reduced  to  an  adder. 

The  decentralized  estimation  problem  has  been  analyzed  both  in  discrete  time  and  in 
continuous  time  domain.  It  has  been  proposed  that  the  nonlinear  measurement  equations 


V 


can  be  dealt  with  by  a  recent  technique  know'n  as  “Modined  Cain  Extended  Kalman  Filter 
(MGEKF)  more  elEcienily  than  the  conventional  Extended  Kalman  Filter  (EKF) 
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CHAPTER  1 


INTRODUCTION  AND  SUMMARY 

1 . 1  Introduction  and  Overall  Objective  of  Phase  I 

This  report  contains  results  from  the  investigations  of  the  project  "Opnmal  Sensor 
Configuration  and  Survivable  Processing  with  Correlated  Noise."  This  is  a  Phase  I  effort 
sponsored  by  the  Naval  Research  Center  of  Arlington,  Viiginia  through  Small  Business 
Innovative  Research  (SBIR)  Program  under  contract  #N00014-88-C-0736.  This  job  was 
sponsored  by  the  Navy  through  the  Strategic  Defense  Initiative  Organization  (SDIO).  This 
work  was  performed  at  Scientific  Systems,  Inc.  (SSI),  Cambridge,  Massachusetts  during 
the  period  October,  1988  -  Apr!,  1989. 

The  surveillance,  acquisiuon  and  uacking  function  for  the  Strategic  Defense  Ininative 
(SDI)  includes  sensing  information  .‘or  battle  management  and  processing  signals  and  data 
for  discnmination  of  threatening  reentry  vehicles  from  other  objects.  As  each  potennal 
reentry  vehicle  is  released  from  its  post-boost  vehicle,  it  begins  ballistic  midcourse  flight 
accompanied  by  deployment  hardware  and  possibly  by  decoys.  Each  credible  object  must 
be  accounted  for  in  a  birth-to-death  track,  even  if  the  price  is  many  decoy  false  alarms. 
Interceptor  vehicles  of  the  defense  must  also  be  tracked. 

The  research  proposed  under  this  project  is  concerned  with  the  problem  of  accurate 
target  tracking  in  the  SDI  environment.  The  problem  of  tracking  boosters  and  reentry 
vehicles  has  received  much  attention  in  the  past.  However,  the  problem  has  generally  been 
structured  as  a  single-sensor,  single-target  problem.  The  SDI  environment  is  much  more 
comple.\,  however,  in  that  many  targets  must  be  uacked  simultaneously.  In  addition,  many 
sensors  may  be  used  to  track  a  single  target.  Although  the  use  of  multiple  sensors 
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intuitively  allows  more  accurate  and  robust  tracking  to  be  attained,  it  is  not  yet  clear  how  to 
fully  exploit  this  additional  capability.  In  addition,  the  SDI  environment  is  such  that  the 
sensor  observations  are  subject  to  noises  which  are  correlated  across  sensors  This  further 
complicates  the  problem. 

Ideally,  the  problem  of  target  tracking  in  a  multi-sensor  environment  with  limited 
communication  between  the  sensors  is  a  distributed  estimation  and  detection  problem  which 
uses  no  centralized  signal  processing.  The  information  available  to  each  sensor  consists  of 
Its  ow'n  measurement  history  and  possibly  all  past  messages  received  from  other  sensors 
In  this  report  we  shall  refer  to  this  problem  also  as  a  distributed  tracking  problem.  The 
overall  objective  of  this  3-phase  project  is  to  obtain  a  practical  solution  of  the  distributed 
tracking  algonthm  for  multiple  targets  under  the  assumption  that  the  measurement  noises 
are  conelated  across  the  sensors.  This  report  contains  Phase  I  activities  and  indicates  the 
feasibility  of  the  proposed  technique  for  solving  this  problem.  Detailed  development  of  the 
technique  will  be  earned  out  in  the  forthcoming  Phase  n  and  finally,  these  ideas  will  be 
commercialized  in  Phase  III. 

The  overall  problem  of  distributed  tracking  is  indeed  a  very  difficult  and  complex 
one  This  problem  can  be  decomposed  into  vanous  subproblems  that  can  be  arranged  into 
hierarchical  layers  according  to  their  complexity  as  shown  in  Figure  1.1.  Obviously,  the 
starting  point  is  the  centralized  processing  where  raw  measurements  from  all  the  sensors 
are  transmitted  to  a  common  fusion  center  for  simultaneous  processing  -  this  is  the  lowest 
layer  of  the  hierarchy. 
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Figure  1.1;  Decomposition  of  the  Target  Tracking  Problem  in  an  SDI 
Framework 


One  of  the  senous  criticisms  of  this  technique  is  that  the  system  is  vulnerable  to 
catastrophic  failure  if  the  central  node  fails.  On  the  other  hand,  in  the  case  of  distributed 
processing,  the  observed  data  from  each  sensor  is  processed  locally.  The  processed  data 
are  then  communicated  to  a  centralized  coordinator  who  forms  the  global  estimate.  In  this 
case,  the  system  performance  will  have  a  graceful  degradation  in  the  event  of  sensor/node 
failure.  This  is  the  pnmary  motivation  for  opting  for  the  distributed  processing  over  the 
centralized  one. 


Vanous  components  of  the  distributed  tracking  problem  ate  shown  in  Figure  1.1. 
This  figure  indicates  a  systematic  approach  to  the  overall  solution  of  the  problem.  Any 
subset  of  the  blocks  of  this  figure  which  are  connected  by  arrows  can  be  combined  to  form 
a  meaningful  tracking  problem.  The  core  of  the  distributed  tracking  problem  is  the  case  of 
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one-target,  multiple  senior  with  distnbuted  processing"  and  is  shown  at  the  second  layer 
of  Figure  1  1  In  this  scenario,  time  delays  due  to  computation  and  communication 
processes  are  neglected  It  is  assumed  that  the  sensors  are  communicating  with  each  other 
instantaneousK  in  a  broadcast  mode  Although  this  is  a  hi'pcthencal  setting,  the  analysis  of 
this  case  exhibits  the  structure  of  the  decentralized  processing  problem.  In  a  realistic 
situation  the  measurements  from  various  sensors  will  be  correlated  primarily  because  the 
sensors  are  observing  the  same  target  through  the  same  atmospheric  medium.  The 
decentralized  estimation  problem  then  becomes  considerably  difficult  if  the  correlation  is 
taken  into  account.  The  measurement  equations  which  are  nonlinear  in  states  also  adds 
further  complication  to  this  problem.  Once  the  distributed  tracking  problem  is  solved  for 
correlated  noise  and  nonlinear  measurement  equations,  the  effect  of  a  realistic 
communication  channel  can  then  be  considered  In  an  SDl  framework,  direct 
communication  takes  place  only  between  adjacent  satellites  arranged  in  a  nng  structure 
about  the  Earth  Durtng  thts  phase  of  analysis,  time  delays  due  to  computation  and 
communication  processes  must  be  taken  into  account  if  the  algorithm  is  intended  for 
practical  application  Finally,  the  scenario  for  multiple  targets  can  be  analyzed  as  an 
extension  of  the  one-target  case. 

Because  of  the  time  limitations  in  Phase  1,  we  could  not  solve  all  subproblems  of 
Figure  1  1  We  have  explored  only  the  basic  problem,  the  case  of  correlated  measurement 
noise  and  nonlinear  measurement  equations  as  shown  in  layers  2  and  3  of  our 
decomposition 

1.2  A  Simplified  Generic  SDI  Tracking  Problem 

Let  us  consider  an  example  of  a  simple  SDI  tracking  problem  with  one  target  only  - 
this  scenario  will  be  broadened  to  multiple  targets  in  Phase  II.  Formulation  of  the 
decentralized  estimation  problem  will  be  monvated  through  this  example.  Consider  then  a 
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generii.  SDI  tracking  problem  with  one  target  in  a  ballistic  trajectory  and  three  satellite 
sensors  as  shown  in  Figure  1.2.  The  sensors  are  assumed  to  make  time-synchronized 
measurements  The  measurements  are  taken  through  one  or  more  of  the  laser,  infrared  and 
radar  sensors  that  are  located  at  appropnate  places  of  die  orbiting  satellite  stations. 


Figure  1.2:  A  Simplined  SDI  Tracking  Problem 

Both  active  and  passive  sensors  will  be  considered  under  the  scope  of  the  present 
task  An  active  sensor  such  as  a  radar  measures  the  range  (R),  azimuth  angle  (A)  and 
elevation  angle  (E)  of  the  target  in  a  reference  frame  that  is  centered  at  the  radar.  On  the 
other  hand,  a  passive  sensor  such  as  an  infrared  sensor  produces  "bearings  only" 
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measurements  The  SDI  tracking  problem  can  be  formulated  in  the  "measurement 
coordinate  system"  where  the  variables  are  R,  A,  E  and  their  derivatives  or  in  the  "cartesian 
coordinate  systems'  in  which  the  variables  are  x,  y,  z-position  of  the  target  and  their 
denvatives  In  the  latter  case,  an  inertial  reference  frame  is  used. 

Selecting  a  Coordinate  System 

It  can  be  shown  that  in  the  Cartesian  coordinate  system,  the  dynamical  equation  for 
the  target  motion  is  linear  whereas  the  measurement  equation  is  non-linear  in  states.  On  the 
other  hand,  in  the  measurement  coordinate  system,  the  dynamical  equation  is  non-linear 
whereas  the  measurement  equation  is  linear  in  state  variable.  Therefore  in  both  the 
coordinate  systems,  an  extended  Kalman  filter  may  be  used  for  state  estimation.  It  is  well 
known  that  an  extended  Kalman  filter  has  a  bias  in  its  estimates  [Jazwinski,  1970].  A 
detailed  study  on  this  issue  was  made  by  Mehra  [1971].  He  observed  that  this  bias  is 
related  to  the  non-lineanties  in  the  equations  of  motion  and  the  measurement  system  By 
choosing  different  coordinate  systems,  one  can  alter  these  non-lineanties.  In  particular,  if 
one  uses  tne  coordinate  system  in  which  the  measurements  are  linear,  the  bias  due  to 
measurement  non-lineanty  is  eliminated  This  does  not  necessarily  mean  that  the  total  bias 
IS  reduced  since  the  bias  from  other  non-lineanties  might  increase  Finally,  he  has  shown 
that  an  extended  Kalman  filter  that  uses  the  measurement  coordinate  system  has  less  bias 
and  less  rms  error  than  a  Canesian  extended  Kalman  filter  that  uses  the  Canesian 
coordinate  system  This  fact  cenainly  favors  the  use  of  the  range  (R),  azimuth  (A),  and 
elevation  (E)  angle  vanables  in  the  SDI  tracking  problem  formulation  But  Song  and 
Speyer  (1984)  has  shown  that  for  a  class  of  non-Iinear  measurement  equations  known  as 
modifiable  functions."  the  structure  of  the  optimal  filter  is  linear.  They  have  called  this 
filter  a  "Modified  Gain  Extended  Kalman  Filtei"  (MGEKF).  Fortunately,  many  "bearings 
only "  measurement  equations  belong  to  the  class  of  "modifiable  functions"  and  we  have 
found  that  we  can  profitably  use  this  concept  to  solve  the  distributed  tracking  algorithm  A 


linear  filter  is  certainly  more  attractive  than  a  nonlinear  filter.  Motivated  by  these 
observations,  we  will  use  the  Cartesian  coordinate  system  throughout  this  project.  In  this 
coordinate  system,  the  dynamic  equation  is  linear  whereas  the  measurement  equation  is 
nonlinear  in  states 

1.3  Summary  of  Phase  I  Activities 

The  problem  of  distnbuted  detection  and  estimation  has  lately  received  considerable 
attention  in  the  literature  Therefore,  as  a  first  task  in  Phase  I,  relevant  publications  were 
surveyed  The  useful  results  from  these  papers  were  collated  and  several 
modifications/exter  ions  were  m,.de  that  are  appropriate  for  the  problem  at  hand  Some  of 
the  related  works  on  which  the  proposed  technique  heavily  relies  are: 

•  Speyer  (1979)  considered  the  discrete-time  distributed  estimation  problem 
assuming  that  all  sensors  communicated  information  to  all  other  sensors  in  the 
network  tn  a  broadcast  mode.  By  formulating  a  linear  Gaussian  measurement 
problem  with  a  Gauss-Markov  state  equation,  he  was  able  to  show  that  the 
optimum  estimate  at  each  sensor  could  be  obtained  using  its  own  Kalman  filter. 
Each  sensor  sends  its  local  stale  estimate  to  each  other  sensor  at  each  time  step 
which  then  constructs  the  globally  optimum  state  The  expense  of  distributed 
estimation  is  that  an  additional  data-dependent  vector  of  dimension  equal  to  the 
state  vector  dimension  has  to  be  communicated  from  each  sensor  to  every  other 
sensor  at  each  time  step.  This  solution  is  attractive  since  each  sensor  computes 
the  optimal  (minimum  variance)  state  estimate  and  only  a  relatively  small 
amount  of  communication  is  required  between  sensors. 

•  Willsky  et.  al.  (1982)  considered  a  more  general  case  than  Speyer  (1979)  and 
formulated  the  problem  in  continuous  time  domain  They  considered  the  case 
where  the  local  models  assumed  by  various  sensors  are  possibly  different  than 


the  true  global  model  They  also  solved  the  distributed  smoothing  problem. 
Their  results  also  show  that  each  node  must  send  2  vectors  -  us  state  estimate 
and  a  data-dependent  vector  to  the  central  coordinator  which  constructs  the 
globally  optimum  estimate. 

•  Castanon  and  Teneketzis  (1985)  have  developed  algorithms  for  distributed 
nonlinear  estimation  and  showed  that  if  each  sensor  sends  a  set  of  local 
sufficient  statistics  to  each  other  sensor,  each  sensor  can  construct  the  global 
centralized  distribution.  These  results  assume  that  all  sensors  have  the  same  a 
priori  knowledge  of  system  uncertainties,  that  all  (two-way)  communication 
links  are  unfailed,  and  that  all  communication  channels  are  memorylcss  and 
independent 

But  the  problem  with  coirelated  sensor  noise  was  not  addressed  in  these  literature. 

In  Phase  I,  we  have  extended  the  results  of  Willsky  et.  al.  (1982)  to  show  that  only 
one  vector,  instead  of  two,  need  to  be  sent  to  the  global  coordinator.  We  have  also 
developed  an  expression  that  shows  the  effect  of  initial  uncertainty  of  various  sensors. 
Also,  we  have  extended  the  result  of  Speyer  (1979)  to  include  the  case  where  the  local 
models  are  not  necessarily  idenncal  to  the  global  (true)  model. 

Three  schemes  have  been  developed  for  solving  the  problem  with  correlated  sensor 
noise.  The  first  two  of  these  are  straightforward  application  of  the  Kalman  filter  at  a  central 
node.  In  the  first  scheme,  all  the  sensor  data  are  transmitted  to  a  central  node  and  a  Kalman 
filter  IS  applied  to  the  aggregated  data.  The  second  scheme  also  consists  of  the  centrahzed 
processing  but  before  the  Kalman  filter  is  applied,  the  sensor  measurements  are 
transformed  into  new  measurements  that  are  uncorrelated  across  the  sensors.  Clearly,  there 
IS  no  advantage  of  the  second  scheme  over  the  first.  However,  the  third  scheme  is  truly  a 
decentralized  one  where  the  observed  data  is  processed  locally  and  then  the  processed  data 
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IS  sent  to  the  central  node  where  it  is  fused  with  the  other  incoming  data  into  the  globally 
optimal  estimate. 

We  have  fond  that  nonlinear  measurement  equations  can  be  dealt  with  in  the 
dtstnbuted  tracking  problem  by  utilizing  the  "modifiable  fun  tion"  concept  of  Song  and 
Spe^  '1984;.  In  this  phase  we  have  successfully  applied  this  concept  to  the  problem  of 
"beanngs  only"  measurements.  In  Phase  11,  this  technique  will  be  extended  to  the  case  of 
distributed  processing 

The  remainder  of  this  report  is  organized  as  follows:  In  Chapter  2,  the  distributed 
processing  problem  has  been  described  in  continuous  time  domain  along  the  line  of  Willsky 
et.  al  (1982)  and  their  results  have  been  extended.  In  Chapter  3,  the  discrete  time  version 
of  the  tracking  problem  has  been  presented  where  the  results  of  Speyer  (1979)  are  extended 
to  include  the  case  of  possibly  dissimilar  global  and  local  models.  It  will  be  assumed  in 
Chapter  2  and  3  that  the  sensors  are  uncorrelated.  The  case  of  correlated  sensor  noise  will 
be  dealt  with  in  Chapter  4  where  three  techniques  will  be  proposed  for  solving  this 
problem  The  case  of  nonlinear  measurement  equations  will  be  dealt  with  in  Chapter  5. 
Finally,  conclusions  and  future  recommendations  wili  be  given  in  Chapter  6. 
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CHAPTER  2 


DECENTRALIZED  ESTIMATION:  CONTINUOUS  TIME  CASE 

2.1  Introduction 

In  an  SDI  environment  a  target  is  tracked  by  many  sensors  placed  in  orbiting  satellites 
at  desired  altitudes.  These  satellites  are  linked  via  communication  channels.  The  goal  is  to 
generate  the  optimal  estimate  of  the  state  ffajectory  of  the  target  from  these  observations. 
Clearly,  the  simplest  scheme  by  which  it  can  be  done  is  to  transmit  at  each  instant  of  time 
all  the  sensor  data  to  a  central  coordinator  where  the  optimal  trajectory  can  be  computed 
conditional  on  the  measurement  history.  Although  this  is  a  simple  scheme,  it  suffers  from 
many  drawbacks.  For  example,  the  communication  introduces  a  finite  delay  in  the 
observed  data  and,  in  addition,  adds  to  it  noise  from  the  communication  channel.  It  may  be 
recalled  that  the  observed  data  was,  to  begin  with,  already  corrupted  from  the  sensor  noise. 
The  most  serious  criticism  of  this  scheme  is  that  all  the  computations  are  done  at  a  central 
node  and  in  the  event  of  a  failure  (soft  or  hard)  of  this  node,  catastrophic  failure  of  the 
system  performance  takes  place  This  can  be  avoided  if  the  sensor  data  is  processed  locally 
to  obtain,  say,  locally  optimum  state  esumate  conditional  on  the  observed  data  oy  that 
sensor  only  and  then  these  estimates  are  sent  to  the  centralized  coordinator  where  the 
globally  optimum  estimate  will  be  reconstructed  This  scheme  is  known  as  '  decentralized" 
or  "distributed  "  estimation.  In  this  scheme,  the  system  performance  will  suffer  graceful 
degradation  in  the  event  of  a  sensor  failure.  Moreover,  the  computational  burden  is  shared 
by  all  the  sensors  without  imposing  a  heavy  workload  on  the  centralized  coordinator.  In 
order  to  develop  redundancy  into  the  system,  each  sensor  should  ideally  act  as  a  centralized 
coordinator  in  the  sense  that  each  sensor  will  receive  the  estimate  from  the  other  sensors 
and  generate  idenncal  global  estimates. 
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A  difficult  pan  of  the  decentralized  scheme,  however,  is  the  development  of  a  fusion 
scheme  which  combines  in  an  appropnate  fashion  all  the  local  estimates  into  a  global 
estimate.  The  pnmary  purpose  of  this  chapter  is  to  formulate  this  problem  for  the 
continuous  time  case,  review  the  existing  results  and  finally  extend  these  results 
appropriately. 

A  practical  combining  (fusion)  algonthm  must  take  into  account  the  communication 
delay  and  the  channel  noise,  but  it  will  make  the  analysis  too  complicated.  Therefore  in 
Phase  I,  we  have  considered  an  ideal  situation  where  the  communication  delay  and  channel 
noise  have  been  neglected  These  issues  will  be  dealt  with  in  Phase  II  In  this  chapter  we 
have  emphasized  the  development  of  a  mixing  algorithm  under  the  assumption  that  all 
sensors  are  in  a  broadcast  mode  and  receive  information  from  the  other  sensors 
instantaneously.  We  have  pnmanly  followed  the  works  of  Willsky  et  al.  (1982)  in  the 
development  of  this  algorithm. 

A  summary  of  this  chapter  is  as  follows  In  Section  2.2,  the  decentralized  estimation 
scheme  is  developed  under  the  above  simplifying  assumptions.  Wiilsky  et  al.  (1982)  has 
shown  that  in  order  to  obtain  the  globally  optimum  state  estimate,  each  node  (sensor)  must 
send  to  the  central  coordinator  its  own  state  estimate.  But  the  structure  of  the  local  and 
central  nodes  is  complicated.  We  have  extended  this  result  to  simplify  the  computational 
complexity.  In  this  section  we  have  further  assumed  that  the  global  mode!  is  not 
necessarily  the  same  as  the  local  model  each  sensor  (agent)  has  possibly  a  different  model 
about  the  target.  The  global  model  is  the  true  model  of  the  target.  This  assumption  is 
relaxed  in  Secuon  2.3  where  we  have  assumed  that  the  global  model  is  the  same  as  the  local 
model.  This  case  was  onginally  analyzed  by  Speyer  (1979)  It  is  shown  in  this  case  that 
the  globally  optimal  estimate  is  a  linear  combinanon  of  the  local  estimate  plus  a  dynamic 
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correction  that  results  because  of  the  distributed  nature  of  the  algorithm.  Finally,  some 
conclusions  are  drawn  in  Section  2.4 

2.2  Decentralized  Estimation  and  Transmission  Requirements 

As  mentioned  before,  the  problem  of  decentralized  estimation  will  be  formulated 
under  the  following  assumpnons: 

•  The  dynamics  are  evolving  in  continuous  time;  the  discrete  time  case  will  be 
analyzed  in  the  next  chapter. 

•  Only  one  target  is  to  be  tracked. 

•  The  target  dynamics  are  linear,  but  may  possibly  be  time  varying. 

•  The  measurement  equations  are  linear  in  states;  the  nonlinear  equations  will  be 
treated  in  Chapter  5,  using  "modifiable  function"  concept. 

•  There  is  no  communicanon  delay 

•  There  is  no  channel  noise. 

•  The  nodes  arc  in  a  broadcast  node,  i.e.  each  node  can  communicate  with  all  the 
other  nodes.  The  ring  structure  of  a  realistic  sensor  configuration  will  be  dealt 
with  in  Phase  II. 

•  The  sensors  are  uncorrelated  from  each  other;  the  case  of  correlated  sensors  will 
be  analyzed  in  Chapter  4. 

First,  let  us  fix  the  terminologies  that  will  be  used  throughout  this  report.  We  assume 
a  scenano  where  multiple  sensors  are  observing  an  SDl  target.  We  also  assume  that  these 
sensors  are  equipped  with  the  data  processing  capabilities.  The  sensors  are  also  known  as 
nodes,  local  agents  or  local  processors.  The  target  state  model  is  the  true  model  and  its 
state  estimate  is  to  be  computed  from  the  sensor  measurements.  The  true  model  is  also 
known  as  the  central  or  global  model.  In  this  section  we  consider  a  general  case:  we 


fusion  center,  also  known  as  a  mixing  mode  where  all  the  local  estimates  will  be  combined 
to  obtain  the  global  estimate.  Since  all  the  sensors  are  in  a  broadcast  mode,  each  sensor 
receives  raw  observations  or  processed  data  from  the  other  sensors.  We  assume  here  that 
each  sensor  acts  as  a  fusion  center  and  combines  the  data  from  the  other  sensors  into  a 
global  estimate.  Since  each  sensor  is  producing  identical  global  estimates,  this  scheme 
introduces  enough  redundancy  into  the  system  and  makes  it  robust  against  sensor  failure. 

Suppose  that  there  are  J  sensors.  The  global  and  the  local  models  are  given  in  the 
following. 


Global  Model 

x(t)  =  A(t)x(t)  +  w(t) ;  tSto;  ^2.1  a) 

Zj(t)  =  Cj(t)x(t)  +  Vj(t) ;  j  =  1 ,  2,  ...J  ^2. 1  b) 

Various  statistics  relating  to  the  model  are: 

E  [x(to)]  =  x(to) ;  E  [(x(to)  -  x(to))  (x(to)  -  x(to))^)  =  X(to) ; 

E  [w(t)]  =  0 ;  E  [w(t)w^(t)]  =  Q(t)8(t  -  x) ; 

E  [vj(t)]  =  0 ,  E  [v/t)vi(x))  =  R/t)5(t  -  T)5jkRj(t)  =  R]'(t)  >  0  . 

We  assume  that  w(t)  is  independent  of  x(to)  and  of  Vj(t)  for  all  j  =  1,  2,  ...J.  It  is  also 
assumed  that  Vj(t)  is  independent  of  x(to). 

Although  the  true  model  is  given  by  (2.1),  the  sensors  may  not  have  the  complete 
knowledge  of  it  Suppose  the  j-th  sensor  is  assuming  the  following  model  about  the  target: 
this  model  is  also  known  as  the  local  model. 


Local  Models 

Xj(t)  =  Aj(t)Xj(t)  +  Wj(t) 


(2.2a) 


13 


Ej(t)  =  Hj(t)Xj(t)  +  Vj(t) ;  j  =  1, 2,  ...J 


(2.2b) 


E  [Xj(to)]  =  Xj(to) E  [(Xj(to)  -  Xj(to))  (x/to)  -  Xj(to))^]  =  E/to) ; 

E  [wj(t)]  =  0 ;  E  [Wj(t)wJ(T)l  =  Qj(t)5{t  -  x) ; 

E  (v/t)]  =  0 ;  E  [ Vj(t)v^(x)]  =  R/t)5(t  -  x)5jk :  R/O  =  R]'(t)  >  0  . 

As  before,  we  assume  that  Vj(t)  is  independent  of  Xj(to)  and  of  Wj(t)  for  all  j  =  1,  2,  ...J. 
Notice  that  the  measurement  noise  process  has  been  assumed  to  be  the  same  for  both  global 
and  local  models  (2.1)  and  (2.2).  This  is  a  plausible  assumption,  because  whatever  the 
underlying  model  is,  the  sensor  is  always  the  same  and  therefore  the  sensor  noise  ought  to 
be  the  same  too.  However,  m  order  to  make  the  problem  well  posed  we  need  to  impose  the 
following  additional  relationship  between  the  local  and  global  models: 

C/t)  =  H,(t)M/t);  j=l,2,...J  (2  3) 

where  Mj(t)  is  a  time  varying  matrix  of  appropriate  dimension.  Besides  the  requirement  of 
(2.3)  and  that  the  sensor  noise  Vj(t)  are  the  same,  the  two  models  can  be  totally  arbitrary. 


General  Solution  Without  Decentralized  Considerations 


The  globally  optimal  estimate  x(t)  using  all  the  information  is  given  by 

x(t)  =  E[x(t)  I  z,(^),  Z2(^),  ...Zj(^) tflS^St]  (2  4^) 

where  E(*]  is  the  expectation  operator  and  can  be  implemented  at  a  central  node  if  all  the 
sensor  data  Zj(t),  j  =  1 ,  2,  ...J  are  sent  to  this  node.  In  order  to  evaluate  x(t)  first  define 


A 

z(t)  = 


'zi(t)‘ 

■c,(t)‘ 

’vi(t)' 

Z2(t) 

A 

;  C(t)  = 

C2(t) 

A 

;  v(t)  = 

V2(t) 

Zj(t) 

CKO 

Vj(t)_ 

(2.4b) 
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then  all  the  sensor  data  can  be  compactly  represented  as 

z(t)  =  C(t)x(t)  +  v(t) .  (2,4c) 

The  opnm  J  estimate  x(t)  conditional  upon  the  measurement  history  is  then 
i(t)  =  E[x(t)  I  z®.  to<^St] 
which  IS  propagated  as 

x(t)  =  A(t)x(t)  +  K‘(t)D‘(t)  f  (2.5a) 

u‘(t)  =  z(t)  -  C(t)x(t) ,  (2.5b) 

K'(t):-P(t)c’^(t)R''(t).  (2.5c) 

Here  K'(t)  is  the  centralized  Kalman  gain.  The  covariance  matrix  P(t)  is  the  solution  of  the 
Riccati  equation 

P(t)  =  A(t)P(t)  +  P(t)A'^(t)  +  Q(t)  -  P(t)C’^(t)R-‘(t)C(t)P(t) .  (2.5d) 

P(to)  =  I(to). 

R(t)  is  the  covariance  matrix  associated  with  the  sensor  noise  v(t),  i.e. 

E[v(t)v^(T)]  =  R(t)5(t-x)  (2.6a) 

The  complexity  of  the  decentralized  algorithm  depends  upon  the  structure  of  the  R(t) 
matnx.  Since  we  are  considering  the  case  of  uncorrelated  sensor  noise,  various  subvectors 
of  vCtl  of  (2.4b)  are  uncorrelated.  Therefore 

R(t)  =  block  diag  {Rj(t)}  .  (2.6b) 


15 


R(t)  will  be  a  ''full"  matrix  if  the  sensors  are  correlated.  However  using  the  block  strjctuie 
of  R(t)  matnx  the  centralized  Kalman  filter  of  Equation  (2.5)  for  the  uncorrelated  sensor 
noise  can  be  written  as: 

J 

j(t)  =  A(t)x(t)  +  XKj(t)t);(t), 


J=  1 

(2.7a) 

Oj  (t)  =  Zj(t)  -  Cj(t)x(t)  , 

(2.7b) 

K'(t)  =  P(t)cJ'(t)R3"'(t), 

J 

P(t)  =  A(t)P(t)  +  P(t)A’^(t)  +  Q(t) -  ]^(t)c]'(t)R;'(t)Cj(t)P(t) . 

J=* 

(2.7c) 

(2.7d) 

Equation  (2,7a)  can  further  be  simplified  to 

J 

i(t)=F(t)i;(t)+5^Kj(t)z/t) 

j=i 

(2.8a) 

where 

J 

F(t)  =  A(t)  -  XP(')c]'(t)R;'(t)C/t) . 

J=1 

(2.8b) 

An  implementation  of  Equation  (2.8)  is  shown  in  the  block  diagram  in  Figure  2  1.  Notice 
that  the  gam  matrices  K'(t),  j  =  1,2,  ...J  arc  precomputable  and  can  be  stored  aption. 
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Decentralized  Estimation  Using  Local  Data  Processing 

Now  consider  the  local  models  of  Equation  (2.2).  The  optimal  estimate  x/t)  at  the  j- 
th  node  (sensor)  is  construct  rd  only  from  the  local  observation  Zj(t)  from  that  sensor. 
Tiierefore 

Xj(t)  =  E  [xj(t)  I  Zj(4) ,  to  ^  ^  S  i] 

which  evolves  as 

ij(t)  =  Aj(t)i;/t)  +  K^(t)uJ(t),  (2.9a) 

Oj‘‘(t)  =  Zj(t)  =  H,(t)xj(t) , 

Kj(t)  =  Pj(t)H]'(t)R;'(t). 

Here  Kj(t)  is  the  decentralized  Kalman  gam  at  the  j-th  node.  The  error  covariance  matrix 
Pj(t)  can  be  computed  from  either  of  the  following  forms  of  the  Riccati  equanon: 

P/t)  =  Aj(t)Pj(t)  +  P/t)Aj(t)  +  Q/t)  -  Pj(t)c]'(t)R;'(l)C,(t)Pj(t)  (2.9b) 

^(p;'(t)) = -p;‘(t)A/t)  -  Aj(t)p;'(t)  -  p;'(t)Qj(t)p;'(t) 

+  H}(t)R;’(t)H,(t)  (2.9c) 

with  the  initial  condition 
P/to)  =  S/to) 

Equation  (2.9a)  further  reduces  to 

Xj(t)  =  Fj(i)Xj(t)  +  Kj  (t)zj(t)  (2. 10a) 

where 

F/t)  =  Aj(t)  -  Pj(t)H]‘(t)R7'(t)H/t)  =  Aj(t)  -  Kj(t)Hj(t) .  (2. 10b) 
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The  local  estimator  for  the  j-th  node  is  shown  in  Figure  2.2. 

Relationship  Between  the  Local  and  Global  Estimates 

To  find  the  connection  between  the  two  estimates,  we  will  use  the  relationship 
Cj(t)  =  Hj(i)Mj(t)  between  the  local  and  global  models.  This  relationship  has  been 
descnbcd  earlier  in  (2.3)  However  using  this  relationship,  the  global  estimate  of  (2.8a) 
can  be  written  as 

J 

i(t)  =  F(t)i(t)  +  ^P(t)Mj(t)H}(t)R;'(t)z/t) .  (2.na) 

1=1 

From  (2.10)  we  also  have 

P/t)Hj(t)R;'(t)2/t)  =  j/t) -  F/t)J/t) .  (2.nb) 

Using  (2. 1  ib)  in  (2.1  la),  we  obtain  as  the  global  estimate 
J 

x(t)  =  F(t)x(t)  +  ]^Gj(t)  [x{t)  -  Fj(t)Xj(t)) ,,  (2,1 2a) 

1=1 

where 

Gj(t)  =  P(t)Mj(t)P;‘(t).  (2.12b) 

By  straightforward  manipulation,  it  can  be  shown  that 

J  1  J 

x(0  =  x(to)  -  ^Gj(to)xj(to)  +^G/t)Xj(t) 

1=1  1=1 

1  I 

X  f  .  (2.13a) 

where  <>F(t,  x)  is  the  transition  matrix  associated  with  the  dynamics  matrix  F(t)  and 
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Kj(t)  =  F(t)Gj(t)  -  Gj(t)  -  Gj(t)Fj(t) 

=  [p(t)Mj'(t)p;’(t)Qj(t)p;*(i)-Q(OMj(t)p;’(t)] 

+  (P(t)Mj(t)A]'(t)P;'(t)  -  P(t)A’'(t)Mj(t)P7'(t) 

-P(OMj(t)P;'(t)]  (2.13b) 

The  indicated  inverse  of  the  matrices  are  assumed  to  exist.  However  this  is  the  key 
equation  relating  the  local  and  global  estimates.  This  equation  also  shows  the  effect  of 
initial  conditions  at  various  sensors  on  the  global  estimate.  It  is  obvious  from  (2.13a)  that 
x(t)  can  also  be  generated  dynamically  from  the  following  equations: 

J  J 

^(t)  =  F(t)^(t)  +  ^K,(t)i/t);  ^(to)  =  x(to)-XG/*o)Xj(to)  (2-14a) 
1=1  j=i 

J 

x(t)  =  ^(t)  +  ^Gj(t)Xj(t)  (2. 14b) 

1=1 

These  equations  were  first  established  by  Willsky  et  al.  (1982).  Equations  (2.14)  clearly 
reveal  the  structure  of  the  decentralized  esamation:  the  measurements  at  the  j-th  sensor  is 
processed  locally  to  obtain  the  optimal  esomate  xXt)  which  is  then  sent  to  the  cenu^lized 
coordinator,  the  coordinator  then  constructs  x(t)  dynamically  according  to  (2  14).  Notice 
that  all  the  matrices  in  (2.14)  are  precomoutable  and  can  be  stored  aprion.  The  block 
diagram  representing  these  equations  ate  shown  in  Figure  2.3. 

It  can  be  seen  from  Figure  2.3  that  there  is  a  considerable  amount  of  computational 
burden  at  the  central  node  and  the  structure  of  the  combining  algcnthm  is  rather  complex 
The  computational  burden  and  the  complexity  of  the  centralized  coordinator  can  be  reduced 
as  follows.  Define  hj(t)  as  a  data  dependent  vector  for  the  j-th  node  which  evolves  as 

h/t)  =  F(t)hj(t)  +  Kj(t)Jj(t)  ,  f2.15a) 
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with  initial  condition 


hj(io)  =  -<5j(to)xj(to).  (2.15b) 

The  global  estimate  then  simplifies  to 

j 

x(t)  =  (i)F(t,  to)J(to)  +  ■  (2.15c) 

1=  1 

This  fonn  was  first  obtained  by  Speyer  (1979)  for  a  simplified  case  when  the  global  model 
IS  the  same  as  the  local  model  We  will  discuss  this  case  in  the  next  section.  The 
implementation  of  this  form  is  shown  in  Figure  2.4.  In  this  scheme,  each  node  j  must  send 
two  vectors  -  x^(t)  and  hj(t)  to  the  central  node  which  then  constructs  the  global  estimate 
x(t).  Notice  also  that  the  computational  burden  of  the  central  node  has  been  reduced 
considerably  but  only  at  the  expense  of  the  additional  complexity  and  burden  at  the  local 
nodes.  By  simple  block  diagram  manipulation  of  Figure  2.4,  the  complexity  of  the  central 
node  can  be  reduced  further.  To  see  this  mathematically,  define  a  node-dependent  dynamic 
vector  T|j(t)  which  is  generated  as: 

Sj(t)  =  F(t)s/t)  K/t)x,(t) ;  s/to)  =  -G/to).x,(to)  (2. 1 6a) 

Tij(i)  =  Sj(t)  +  Gj(tlXj(t)  (2. 16b) 

Clearly,  then  the  global  estimate  is  given  by 
J 

x(t)  =  9=(t,  t(,)x(to)  +  X  1 6c) 

j=i 

which  can  be  implemented  as  in  Figure  2.5.  Notice  that  in  this  scheme,  only  one  vector 
Tij(t)  is  to  be  sent  to  the  central  node  In  contrast  to  2  vectors  in  the  earlier  scheme  of  (2. 15). 
Thus,  the  transmission  requirement  has  been  reduced  by  50%.  If  x(to)  =  0,  then  the  central 
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node  IS  simply  an  adder,  it  merely  adds  the  incoming  vectors  from  the  various  nodes. 
Notice  also  that  the  complexity  and  the  computational  burden  of  the  central  node  has  been 
reduced  to  minimum  but  only  by  burdening  each  of  the  sensor  nodes  with  these  problems. 


Equation  (2.15)  can  be  manipulated  further  to  reduce  the  eomplexity  of  the  local 
processors  as  follows  Define 

qj(t)  =  Gj(t)Xj(t)  +  hj{t)  (2.17a) 

where  the  evolution  of  h.(t)  with  its  initial  condition  is  given  in  Equations  (2.15a)  and 


(2. 1 5b)  respectively.  Since  hj(to)  =  -Gj(lo)Xj(lo),  the  initial  condition  of  q.(t)  is 

q/<o)  =  0  (2.17b) 

The  global  estimate  can  be  wrinen  m  terms  of  q/t)  using  (2.15c): 

J 

x(t)  =  $F(t,  to)x(to)  +  Iq/l). 

)=>  (2.17c) 

A  recursion  for  q^ft)  can  be  found  as  follows;  Differentiating  both  sides  of  (2.17a)  gives 

q/t)  =  Gj(t)xj(t)  +  Gj(t)«t)  +  h/t) .  (2.17d) 

Using  Equations  (2.9a)  and  (2.15a)  in  the  above  expression,  we  get 

q/t)  =  [G/t)  -t  Gj(t)F/t)  +  K/i)lXj(t)  +  G/t)P/t)H]'(t)R;'(t)z/t) 

+  F(t)h/t) .  (2-17e) 

Finally,  using  expression  for  Kj(t)  from  (2.13b)  in  (2.17e)  and  simplifying,  we  have, 

qj(t)  =  F(t)q/t)  +  Kj(t)z/t)  (2.170 
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where 

The  decentralized  estimation  scheme  given  in  (2.17)  above  is  shown  Figure  2.6. 
This  scheme  may  be  compared  with  the  one  in  Figure  2.5.  Notice  that  the  complexity  of 
the  local  processor  has  been  reduced  and  yet  the  transmission  requuement  has  remained  the 
same  each  local  processor  need  to  send  to  the  mixing  node  only  one  data  dependent  vector 
qj(t)  of  the  dimension  of  the  true  (global)  state. 

An  important  remark  is  in  order  here.  The  fact  that  the  sensors  are  uncorrelated  has 
played  a  crucial  role  in  developing  the  decentralized  scheme  here.  It  is  the  block  diagonal 
structure  of  the  R(t)  matrix  which  has  enabled  us  to  decompose  the  centralized  estimation 
scheme  of  (2.5)  into  additive  form  of  (2.7).  The  centralized  Kalman  gain  K'(t)  can  be 
partinoned  as  K'(t)  =  [Ki(t),  K2(t) ...  Kjft))  each  block  of  which  corresponds  to  a  sensor. 
When  the  sensors  are  correlated,  i.e.  the  R(t)  matrix  is  of  "full”  structure,  it  is  not  clear 
how  to  decompose  the  global  esdmation  problem  into  various  subproblems. 

2.3  The  Special  Case  of  Identical  Local  and  Global  Models 

In  this  section  we  consider  a  special  case  of  the  more  general  formulation  of  the  last 
secnon.  Specifically,  we  assume  that  the  models  used  by  the  local  processors  are  identical 
to  the  global  model.  This  case  was  first  examined  by  Speyer  (1979).  Under  this 
assumption 

A/t)  =  A(t),  Qj(t)  =  Q(t),  Cj(t)  =  H,(t),  Mj(t)  =  I  forallj.  (2.17) 

In  this  case,  the  expression  for  Kj(t)  of  Equation  (2.13b)  simplifies  to 


I 
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K/t) = p(t)p;'(t)Q(t)p;‘(t)-Q{t)p;'(t) 
=  [p(t)p;'(t)-i]  Q(t)p;'(t). 


(2.18) 


Since  Mj(t)  =  I,  we  have  set  Mj(t)  =  0  to  find  the  above  expression.  The  other  relevant 
variables  for  this  case  are 


Fit)  =  A(t)  -  ]^P(t)c}(t)R;'(t)C,(t) 

j  =  i 

F,(t)  =  A(t)  -  Pj(t)cJ(t)R;'(t)C/t) ; 
G/t)  =  P(t)P;‘(t); 

K‘(t)  =  P(t)cJ(t)R;'(t); 

K^(t)  =  Pj(t)C]'(t)R;‘(t). 


(2.18b) 


An  imponant  consequence  of  this  simplification  is  that  the  global  estimate  of  (2.14b)  can  be 
wntten  as 

j 

x(t)  =  ^(i)  +  P(t)^Pj’(t).\/t)  (2.19) 

j=i 


Note  that  the  second  term  in  the  expression  for  x(t)  is  the  usual  expression  for  combining 
independent  estimates  However  Xj(t)  are  not  independent  in  general,  and  ^(t)  represents  a 
dynamic  correction  for  this  conelation. 


2.4  Summary 

In  this  chapter,  we  have  formulated  and  solved  the  decentralized  estimation  problem 
for  the  continuous  time  case.  Wc  have  primarily  followed  the  work  of  Willsky  et  al. 
(1982)  to  derive  the  results  of  this  chapter.  The  assumption  that  the  sensors  are 
uncorrelated  has  played  a  cruaal  role  in  developing  these  algorithms. 
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The  multisensor  data  processing  problem  can  be  solved  in  two  ways.  All  the  sensor 
data  can  be  transmitted  to  a  central  node  and  can  be  processed  for  globally  optimum 
estimates.  .Mternatively,  each  sensor  data  can  be  processed  locally  for  a  locally  optimum 
esnmate  which  then  can  be  transmitted  to  the  central  node.  The  central  node  combines  all 
the  local  estimates  to  obtain  the  global  estimate.  The  later  scheme  is  the  decentralired 
estimation  and  is  preferable  from  the  viewpoint  of  system  survivability.  In  Section  2.2,  a 
generalized  decentralized  estimation  scheme  has  been  formulated.  Here  we  have  assumed 
that  each  of  the  local  processors  may  possibly  have  a  different  model  which  again  is  not 
necessanly  the  same  as  the  global  model.  Several  possible  decentralized  schemes  have 
been  presented  in  Figures  2.3  -  2.5.  In  the  scheme  of  Figure  2.3,  the  local  estimates  are 
sent  to  the  mixing  node  which  in  turn  produces  the  globally  optimum  estimate  dynamically. 
The  structure  of  the  mixing  node  is  complex  in  this  case.  The  complexity  of  this  node  can 
be  reduced  by  transferring  some  of  its  computational  burden  to  the  local  nodes  as  shown  in 
Figure  2.4.  But  in  this  case,  each  local  node  must  send  2  vectors  to  the  mixing  node.  The 
transmission  requu’ement  of  this  scheme  can  be  reduced  to  one  vector  by  adding  additional 
computations  at  the  local  nodes  as  shown  in  Figure  2.5.  The  results  of  this  section  show 
that  the  complexity  and  the  computational  burden  of  the  central  node  can  be  traded  by  those 
of  the  local  nodes.  Finally,  in  Figure  2.6,  the  complexity  of  the  local  processor  has  been 
reduced  without  sacrificing  the  transriussion  efficiency. 

In  Section  2.3,  a  specialized  case  has  been  dealt  with  where  the  local  models  are 
assumed  to  be  the  same  as  the  a  global  model.  This  case  was  first  analyzed  by  Speyer 
(1979).  Under  this  assumpnon,  considerable  simplification  occurs  in  the  algonthms  and  in 
the  structure  of  the  local  and  central  processors. 
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centralized  cowdinator 


Figure  2.1:  Centralized  Estimation 


sensor J: 


x,(t)  , - L 


Figure  2.3:  Decentralized  Estimation:  Basic  Structure 


Figure  2.4: 

Decentralized  Estimation 
-  A  Simpler  Central  Node 


centralized  cooidmalor 


Figure  2.5:  Decentralized  Estimation 
•  Reduced  Transmission  Requirement 


CHAPTER  3 


DECENTRALIZED  ESTIMATION:  DISCRETE  TIME  VERSION 

3.1  Introduction 

The  decentrahzed  estimation  problem  for  the  continuous  time  case  was  analyzed  in  the 
last  chapter  In  this  chapter,  the  corresponding  problem  for  the  discrete  time  case  will  be 
presented.  This  problem  was  first  analyzed  by  Speyer  (1979),  but  for  the  special  case  of 
identical  global  and  local  models.  In  this  chapter,  we  intend  to  extend  his  results  to  a  more 
general  case  of  non-identical  global  and  local  models.  Some  of  these  results  have  already 
been  submitted  before  in  various  progress  reports. 

The  mouvation  for  the  decentralized  estimation  problem  and  vanous  underlying 
assumptions  behmd  the  problem  formulation  have  already  been  presented  in  Chapter  2  and 
will  not  be  repeated  here.  Ihis  chapter  will  be  a  brief  one;  we  shall  only  present  the 
corresponding  results  of  the  Chapter  2  following  the  same  sequence  of  that  chapter.  The 
purpose  of  this  chapter  is  twofold:  first,  for  the  sake  of  completeness  of  this  report  and 
second,  to  expose  some  implementation  details  that  are  distinct  from  the  continuous  time 
case.  Explanations  and  interpretation  of  various  results  are  same  as  in  Chapter  2. 

This  chapter  is  organized  as  follows.  In  Section  3.2  the  decentralized  estimation 
problem  will  be  formulated  and  the  computational  and  transmission  requirement  of  each 
local  processor  will  be  established.  As  in  the  last  chapter,  we  assume  that  each  local 
processor  has  two  alternatives:  eitlier  it  can  send  the  raw  observation  data  to  a  central 
(mixing)  node  where  the  globally  optimum  estimate  will  be  computed  or  it  can  process  the 
raw  data  for  the  locally  optimum  (or  some  kind  of)  estimate  which  are  then  transmitted  to  a 
mixing  node.  The  local  esnmates  are  then  fused  at  the  mixing  node  for  the  global  estimate. 
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We  assume  that  the  local  processing  is  always  a  superior  alternative  than  the  centralized 
processing.  The  reasons  are  given  in  Chapter  2.  In  this  section,  we  shall  present  several 
schemes  for  local  processing  corresponding  to  those  m  Chapter  2.  In  Section  3.3,  we  shall 
simplify  the  denvations  of  Section  3.2  for  the  special  case  of  identical  and  global  models  - 
this  case  was  analyzed  by  Speyer  (1979).  Finally,  some  conclusions  are  provided  in 
Section  3.4. 

3.2  Decentralized  Estimation  and  Transmission  Requirements 

As  in  the  last  chapter,  we  assume  a  scenario  where  multiple  sensors  are  tracking  a 
single  SDI  target.  These  sensors  will  also  be  referred  to  as  nodes,  local  agents  or  local 
processors.  The  target  model  is  called  the  global  or  central  model  and  the  goal  is  to 
esnmate  the  target  state  from  all  the  sensor  measurements.  In  this  section,  we  assume  that 
the  agents  may  not  possibly  have  the  full  knowledge  of  the  global  model.  Instead,  these 
processors  will  have  their  own  mode!  (also  known  as  local  model)  about  the  target  state 
which  they  will  use  to  estimate  the  target  states.  These  estimates  are  then  transmitted  to  a 
fusion  center,  also  known  as  the  mixing  node,  where  all  the  local  csUuiaies  ate  combined  to 
obtain  the  global  estimate. 

Suppose  there  are  J  sensors.  The  global  and  local  models  are  as  follows. 

Global  Model 

x(i  +  1)  =  A(i)  x(i)  +  w(i) ;  i  S  ij 

Zj(i)  =  Cj(i)  x(i)  Vj(i) ;  j  =  1,2,...J 

Vanous  statistics  about  this  model  are  as  follows; 

E  [x(i  o)]  =  x(i  o) :  E  [(x(i  o)  -  x(i  g) )  (x(i  q)  -  x(i  q)  )^]  =  E(i  q) 


(3.1a) 

(3.1b) 
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E  [w(i)]  =  0  ;  Elw(i)w^(m)]  =  Q(i)8,  „ 

E  [vj(i)]  =  0 ;  Elv/OvkCm)]  =  R/i)S,  ;  R/i)  =  rJO)  >  0  . 

We  also  assume  that  w(i)  is  independent  or  xCig)  and  of  Vj(i)  for  all  j=l,2,...J  and 


Vj(i)  is  independent  of  x(io). 

Local  Models 

Xj(i  +1)  =  Aj(i)Xj(i)  +  Wj(i) 

(3.2a) 

Zj(i)  =  Hj(i)xj(i)  +  Vj(i) ;  j=l,2,...J 

(3.2b) 

E  [xj(i  o)]  =  Xj(i  o) ;  E[(Xj(i  q)  -  Xj(i  o))  (x/i  o)  “  Xj(i  o) 

E  ( Wj(i)]  =  0 ;  E[ Wj(i)w^(ni)l  =  Qj(i)8,  n,8jit 
E  (v,(i)]  =  0 :  E[vj(i)vJ(m)!  =  Rj(i)8, 

As  before,  we  assume  that  Vj(i)  is  independent  of  Xj(io)  and  of  Wj(i)  for  all  j  and  i. 
Further,  for  the  well  posedness  of  the  problem,  we  assume  the  existence  of  a  sequence  of 
matrices  (Mj(i))  such  that 

C/i)  =  H/i)Mj(il ;  j=l,2,...J  (3.3) 

General  Solution  Without  Decentralized  Considerations 

The  globally  optimal  filtered  state  using  all  the  information  is  given  by 

x(ili)  =  E[x(i)lzi(Q,  Z2(P,  ...2j(y  ;  ioSCSi  )  (3.4a) 

which  can  be  mechanized  by  a  central  node  if  it  receives  all  the  sensor  data.  In  order  to 

A 

evaluate  x(i  I  i ),  define 
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z(i)  = 

Then  all  the  sensor  data  can  be  compactly  written  as 

z(i)  =  C(i)x(i)  +  v(i).  (3.4c) 

Therefore,  the  optimal  filtered  state  condmonal  on  the  measurement  history  is 
x(ili)  =  E(x(i)  I  z(C);io^CSil 
which  evolves  as  (Bryson  and  Ho  (1975)] 


x(tli)  =  x(i  li-l)  +  K'(i  )u'(i); 

(3.5a) 

u'(i)  =  z(i)-C(i)x(ili-l); 

(3  5b) 

K‘(t)  =  P(ili)c'^(i)R''(i)  . 

(3.5c) 

x(i+lli)  =  A(-.)x(ili); 

where  R(i)  is  the  covanance  matrix  associated  with  v(i)  that  has  been  defined  in  (3.4b)  and 
K‘^(i)  IS  the  centralized  Kalman  gam.  The  covariance  matrix  P(i  I  i)  is  the  solution  of  the 
Riccan  equation  and  follows  from  the  following  recursion: 


P'‘(i  1  i )  =  P''(i  1  i  -1)  +  d(i)R''(i)C(i) ; 

(3.5d) 

P(i  +  !li)  =  A(i)P(ili)A'^(i)  +  Q(i); 

(3.5c) 

P(iolio-l)  =  I(io)- 

R(i)  is  defined  as 

Zi(i)' 

■C,(i) 

‘vi(i)' 

Z2(i) 

,  C(i)  = 

C2(i) 

;  v(i)  = 

V2(l) 

z/i) 

Ci(i) 

Vj(l) 
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(3.6a) 


I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

B 

I 


E[v(i)v^(m)]  =  R(i)5,„. 

Since  we  are  considering  the  case  of  uncoirelated  sensor  noise, 

R(i)  =  block  diag  {Rj(t)}  (3.6b) 

where  Rj(i)  is  defined  in  (3.1)  and  (3.2)  above. 

(3.5a)  can  be  simplified  further  as  follows: 

x(i  +lli  +1)  =  x(i  +lli )  +  K'(i  +l)[z(i  +1)  -  C(i  +l)x(i  +lli )]  (3.7a) 

=  F(i  +l)x(i  I  i )  +  K'(i  +l)z(i  +1)  (3.7b) 

where 

F(i)  =  F(i)A(i"l);  (3.7c) 

J 

F(i)  =  I  -^P(i  I  i  )cJ(i)R;'(i)C/i) ;  (3.7d) 

1=1 


and  K'(i)  is  defined  in  (3.5c).  A  block  diagram  of  this  centralized  scheme  is  shown  in 
Figure  3.1. 

Using  the  block  diagonal  structure  of  the  R(i)  matrix,  (3.7a)  simplifies  to 

I 

J(i  +11 1  +1)  =  F(t  +l)iJ(i  1 1 )  +XKj(i  +l)Zj(i  +1)  (3.8a) 

j=i 

where 

Kj(i)  =  P(i  I  i  )cJ(i)R3"'(i) ;  (3.8b) 
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(3.8c) 


j 

P‘'(i  1 1 )  =  P‘'(i  I  i  -1)  +  5^Cj(i)R;'(i)Cj(i) . 

J=J 

Notice  that  using  (3.8c),  F(i)  of  (3.7d)  can  be  wntien  as 

F(i)  =  P(ili)P''(ili-l).  (3.8d) 

Decentralized  Estimation  Using  Local  Data  Processing 

Now  consider  the  local  model  of  Equation  (3.2).  The  local  estimate  at  the  j-th  node  is 
given  by 

x.(i  1 1 )  =  E(xj(i)  i  ;  i  0  S  ^  <  i } ;  (3.9a) 

which  IS  propagated  as 

X/i  I  i )  =  'xj(i  I  i  -1)  +  Kj(t)u;'(i) ;  (3.9b) 

\)J(i)  =  Zj(i)  -  Hj(t)xj(i  I  i  -1) : 

Kj(i)  =  Pj(ili)Hj'(i)R;'(i): 

Xj(i  +1  I  i )  =  Aj(i).Xj(i  I  i  )  . 

Here  Kj(t)  is  the  decentralized  Kalman  gain  associated  with  the  j-th  sensor.  The  error 
covariance  matrix  Pj(i  I  i )  follows  the  recursion 

p;'(i  I  i )  =  p;'(i  I  i  -1)  -t-  H[(i)R;'(i)Hj(i) ;  (3.9c) 

Pj(i  +1 1  i )  =  A/i)Pj(i  I  i  )Aj(i)  -I-  Q/i) ; 

P/iolio-l)  =  2/io). 

(3.9b)  can  be  simplified  further 
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Xj(i  +lli  +1)  =  Fj(i  +l)Xj(i  I  i )  +  Kf(i  +l)Zj(i  +1)  (3.10a) 

where 

F/i)-^Fj(i  )Aj(i-l) 

Fj(i)  =  I  -  P/ 1 1  i  ,H]'(i)R;*(i)Hj(i) 

=  Pj(!li)P;'(ili-l)  (3.10b) 

and  the  last  equality  has  been  derived  from  (3.9c).  The  local  estimator  for  the  j-th  node  is 
shown  in  Figure  3.2. 

Relationship  Between  the  Global  and  Local  Estimates 

Using  the  relationship  Cj(i)=Hj(i)Mj(i)  in  (3.8a),  we  have  the  global  estimate  as 

'x(i+lli+i)  =  F(i+l)x(ili) 

J 

+]£p(i  +11  i  TDMjfi  +l)H]'(i  +l)R;’(i  +l)z/i  +1)  (3.11b) 

Rearranging  (3. 10a)  after  substituting  the  value  of  +  1 )  gives 

H^fi  +l)R'‘(i  +l)Zj(i  +1)  =  P7‘(i  +11  i  +l)[x/i  +11  i  +1) 

-F/i+l)xj(iii)]  (3.11b) 

Now,  using  (3.1  lb)  in  (3.1  la),  we  get 

I 

x(i+l  li+l)  =  F(i+l)x(ili)+^P(i  +1 1  i  +l)Mj'(i+l) 
j=i 

X  P7‘(i  +1 1  i  +l)lxj(i  +1 1  i  +l)-F/i  +l)Xj(i  I  i )  (3.1  Ic) 

Since  Xj(i  I  i )  =  A7Vi)Xj(i  +11  i ),  we  have 
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Fj(i  +l)Xj(i  i  1 )  =  Fj(i  +l)Aj  (i)ij(i  +11  i ) 

=  Fj(i+l)Xj(i+lli) 


=  Pj(i  +1  !  i  +l)P“*(i  +lli  )Xj(i  +11  i ) . 


(3.12a) 


Using  (3.12a)  in  (3.1  Ic)  gives  the  global  estimate  as 

J 

x(i  +1  1 1  +1)  =  F(i  +l)x(  i  I  i )  +  ]^P(i  +1 1  i  +l)Mj'(i  +l)(P;'(i  +1 1  i  +1) 


X  i/i+l  I  i  +1)-  P;’(i+1 1  i  )Xj(i  +11  i )) .  (3.12b) 

This  expression  suggests  that  x(i  I  i )  is  of  the  form 
j 

x(i  I  i )  =  ^[P(i  I  i  )Mj(i)Pj'(i  I  i  )Xj(i  I  i )  +  hj(i)]  (3.13a) 

where  hj(i)  is  a  node-dependent  vector  of  the  dimension  of  the  state.  This  implies 
J 

x(i  +1 1  i  +1)  =  ^[P(i  +1 1  i  +l)Mj(i  +l)P;'(i  +11  i  +l)Xj(i  +1  I  i  +1) 
j=i 

+  h/i+l)]  (3.13b) 

A 

To  find  a  recursion  for  hj(i  +1),  (3.13a)  is  substituted  in  (3.12b)  for  x(ili)  and  then 
comparing  it  with  (3.13b),  we  obtain 

h/i  +1)  =  F(i  +l)hj(i)  +  F(i  +l)P(i  I  i  )M]'(i)P;*(i  I  i  )X/i  1 1 ) 

—  P(i  +1 1  i  +l)Mj(i  +l)Pj  ^(i  +1 1  i  )Xj(i  +1 1  i )  j^c) 

Since  Xj(i  I  i )  =  A3"'(i)xj(i  +1 1  i ),  the  above  equation  can  be  compactly  written  as: 

hj(i  +  1)  =  F(i  +l)hj(i)  +  Gj(i  +l)Xj(i  +11  i )  (3.14a) 

where 
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F(i+l)  =  F(i  +l)A(i), 

=  P(i  +11  i  +l)P"’(i  +11  i  )A(i) :  (3.14b) 

G(i  +1)  =  P(i  +11  i  +l)P‘'(i  +11  i  )A(i)P(i  I  i  )M,'^(i)P;'(i  1 1 ) 

xA;‘(i)  -P(i+lli+l)M^(i+l)P;‘(i+lli).  {3.14c) 

We  have  used  (3.7c)  and  (3.8d)  in  establishing  (3.14b)  v-hich  in  turn  has  been  used 
to  find  (3. 14c).  We  have  assumed  that  the  inverse  for  the  appropnate  mamces  exists  in  the 
above  expression  The  initial  condition  for  h^fi)  can  be  evaluated  by  solving  (3.13a)  for 

1=  iQ. 

The  complete  scheme  for  generating  the  global  estimate  from  the  local  ones  is 
summarized  as  follows: 

hj(i  +1 )  =  F(i  +l)hj(i)  +  G^(t  +1  )Aj(i)Xj(i  I  i ) 

J 

'i(i  I  i )  =  '  >  )M]'(i)P;'(i  1 1  )xj{i  1 1 )  +  hj(i)) 

p! 

The  block  diagram  corresponding  to  these  equations  arc  shown  in  Figure  3.3.  Notice 

A 

that  each  of  the  local  processors  must  send  2  vectors:  x(i  I  i )  and  hj(i)  to  the  central  node. 

The  transmission  requirement  can  be  reduced  if  some  of  the  computational  burden  of 
the  central  processor  is  transferred  to  the  local  ones.  For  example,  if  we  define 

q,(i)  =  P(i  i  i  )My(i)P''(i  I  i  )Xj(i  I  i )  +  hj(i)  (3.16a) 

then 

J 

x(ili)  =  Xilj(i)  (3.16b) 

pi 


(3.15a) 

(3.15b) 
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I - - - 

and  the  j-th  local  agent  need  to  send  only  this  vector  to  the  mixing  node  -  thus  the 
transmission  requirement  has  been  reduced  to  50%.  Since  all  the  variables  in  (3.16a)  are 
generated  at  the  node,  Tij(i)  can  be  constructed  locally  at  that  node.  The  evolution  of  T|j(i) 
can  be  established  as  follows: 

Tij(i  +1)  =  P(i  +11  i  +l)Mj'(i  +l)P"*(i  +11 1  +l)J/i  +11  i  +1)  +  hj(i  +1)  (3.i7a) 

Using  the  value  of  hj(i  +1)  from  (3.13c)  in  this  expression  and  after  reanangement  gives 

Tij(i  +1)  =  F(i  +l)[hj(i)  +  P{i  1  i  )Mj(i)P;‘(i  I  i  )x/i  1 1 )) 

+  P(i  +11  i  +l)Mj(i  +l)tP7‘(i  +11  i  +l)J/i  +11  i  +1) 
-P;‘(i+lli)Jj(i+lli)].  (3.17b) 

But  the  expression  in  the  second  pair  of  brackets  is 

P;'(i  +11  i  +l)xj(i  +11  i  +1)  -  p;'(i  +11  i  )xj(i  +11  i ) 

=  [P;'(i  +1  h  +1)  -  P;'(i  +11  i  +l)Kj(i  +l)Hj(i  +1)  -  P;'(i  +11  i  )]Xj(i  +11  i ) 

+  P;'(i+lli+l)Kj(i+l)z/i+l) 

=  K'(i+l)Zj(i+l); 

Kj'(i  +1)  =  P(i  +11 1  +l)cj(i  +l)R;'(i  +1) .  (3.17c) 

Therefore  (3.17a)  becomes 

Tij(i+l)  =  F(i+l)Tij(i)  +  Kj(i+l)Zj(i+l) .  (3.17d) 

The  initial  condition  forTij(i)  is  obtained  from  (3.16a): 

tlj(i  qI  =  P('  0  0  0  ^1 0^  0^  (3. 1 7e) 

The  block  diagram  of  this  scheme  is  given  in  Figure  3.4.  Notice  that  the  explicit 
appearance  of  the  local  estimator  has  been  totally  eliminated.  The  structure  of  the  central 
node  IS  very  simple  -  it  simply  adds  the  incoming  vectors. 
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3.3  The  Special  Case  of  Identical  Local  and  Global  Models 

In  this  section  we  consider  the  case  examined  by  Speyer  (1979)  Specifically,  w'e 
assume  that  the  models  use.d  by  the  local  processors  are  identical  to  the  global  model  That 

IS, 

A  =  Aj,  Q  =  Qj,  Cj  =  Hj,  Mj  =  1 .  (3.18) 

In  this  case,  the  filtered  estimate  is  generated  as  follows: 

hj(i  +1)  =  F(i  +l)hj(i)  +  Gj(i  +l)A(i)Xj(i  I  i )  (3.19a) 

J 

x(i  I  i )  =  ]^(P(i !  i  )P;’(i  I  i  )x/i  I  i )  +  h/i)]  (3.19b) 

J=i 

where,  as  before, 

F(i  +1)  =  P(i  +11  i  +l)P'‘(i  +11 1  )A(i) 

G(i  +1)  =  P(i  +11  i  +l)P‘'(i  +11  i  )A(i)P(i  I  i  )P;’(i  I  i  )A‘’(i) 
-P(i+lli+l)P;’(i'-lli). 

A 

The  other  scheme  of  generating  x(i  1 1 )  using  Tij(i)  remains  unchanged  in  this  special  case 
and  is  identical  to  (3.16). 

3.4  Summary  and  Conclusion 

The  decentralized  estimanon  scheme  for  the  discrete  time  case  has  been  presented  in 
this  chapter  whereas  that  for  the  continuous  time  case  was  presented  in  Chapter  2.  In 
Section  3  2  the  problem  has  been  formulated  in  a  more  general  setting;  here  we  have 
assumed  that  the  local  processor  models  are  not  necessarily  the  same  as  the  true  global 
model  except  that  the  sensor  noise  statistics  is  same  for  both  the  models.  The  globally 
optimum  filtered  states  of  the  target  can  be  constructed  as  follows,  raw  observed  data  from 
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vanous  sensor  locations  can  be  transmitted  to  a  central  (mixing)  node  where  all  the  data  can 
be  processed  for  the  global  estimate.  Alternatively,  raw  observed  data  can  be  processed 
locally  and  then  the  local  estimates  are  sent  to  the  mixing  node  where  these  are  combined 
into  the  global  estimate.  Throughout  this  report  we  have  emphasized  that  transmitting 
processed  data  from  the  sensor  locations  to  a  mixing  node  is  superior  to  transmitting  raw 
data.  Along  this  line,  we  have  shown  in  this  section  that  the  globally  optimal  estimate  can 
be  constructed  at  a  central  node  from  two  processed  data  vector  from  each  of  the  local 
processors,  these  are  locally  optimum  estimate  vector  and  another  data  dependent  vector 
that  evolves  recursively  according  to  Equation  (3.14a).  This  scheme  is  shown  in  Figure 
3  3.  This  scheme  can  be  improved  further,  the  transmission  requirement  can  be  reduced 
by  50%  by  transferring  some  of  the  computational  burden  of  the  central  processor  to  the 
local  ones.  In  this  improved  scheme,  each  local  processor  need  to  send  only  one  data 
vector  to  the  central  node,  this  data  vector  is  generated  recursively  at  the  local  node 
according  to  Equation  (3.17d).  This  scheme  is  shown  in  Figure  3.4.  The  case  of  identical 
global  and  local  models  is  dealt  with  in  Section  3.3  -  it  is  observed  that  much 
simplifications  do  not  occur  in  this  case. 
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Figure  3.1:  Centralized  Estimation 


sensor  J. 


Figure  3.2:  Local  Estimator:  j-th  Sensor 
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F(i  +  i: 


nuxing  node 


Figure  3.4:  Decentralized  Estimation:  A  Simpler  Local  Processor  Structure 
and  Reduced  Transmission 


CHAPTER  4 


DECENTRALIZED  ESTIMATION  IN  PRESENCE  OF  CORRELATED 
SENSOR  NOISE 

4.1  Introduction 

In  Chapters  2  and  3,  the  decentralized  estimation  problem  was  solved  for  the  case  of 
uncorrelated  noise.  In  this  chapter,  a  simpler  version  of  the  same  problem  will  be  solved 
for  the  case  of  correlated  sensor  noise.  Because  of  the  imponance  of  this  problem,  we 
have  tned  to  make  this  a  self  contained  chapter. 

The  same  scenano  as  that  of  the  previous  chapters  will  be  adopted  here:  multiple 
sensors  that  are  possibly  located  at  dispersed  geographical  locations  are  observing  a  single 
SDI  target.  Attached  with  each  sensor  is  a  data  processing  capability.  A  sensor  along  with 
Its  own  data  processing  capability  is  also  known  as  a  local  agent  or  a  local  processor.  The 
true  state  of  the  target  is  also  known  as  the  global  state  and  the  ultimate  goal  is  to  obtain  the 
optimal  estimate  of  this  state  by  utilizing  all  the  sensor  measurements.  The  dynamic  model 
of  the  target  state  may  or  may  not  be  available  to  the  local  agents.  We  have  discussed  in  the 
previous  chapters  that  the  global  estimation  problem  can  be  solved  in  two  altemauve  ways. 
First,  all  the  sensors  can  transmit  'heir  raw  observations  to  a  central  node  (also  known  as  a 
fusion  center  or  mixing  node)  to  form  an  augmented  observation  vector  and  a  centralized 
Kalman  filter  can  then  extract  the  optimal  state  from  this  vector.  Alternatively,  each  local 
agent,  by  utilizing  its  ov/n  model  about  the  global  state,  can  process  its  own  observations 
on  site  to  form  the  local  optimal  estimate  conditional  on  us  measurement  history  and  then 
transmit  this  estimate  to  the  fusion  center.  In  the  fusion  center,  the  incoming  estimates 
from  various  nodes  are  combined  in  an  appropriate  fashion  to  construct  the  globally  optimal 
estimate  -  this  is  known  as  a  decentralized  or  distributed  esnmation  scheme.  It  has  been 
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mentioned  earlier  that  the  latter  alternative  is  superior  from  the  system  survivality 
considerauons  In  the  centralized  scheme,  the  system  performance  is  lost  completely  in  the 
event  of  the  central  mode  failure  On  the  other  hand,  in  the  decentralized  scheme,  a 
graceful  degradation  of  the  system  performance  takes  place  in  the  event  of  a  node  failure 
In  order  to  build  an  ideal  redundancy  into  the  system,  each  of  the  local  nodes  should  be 
designed  as  a  fusion  center  in  the  sense  that  each  node  will  receive  estimates  from  all  other 
nodes  and  generate  an  identical  globally  optimal  estimate  of  the  state.  This  can  be 
mechanized  if  all  the  sensors  are  communicating  m  a  broadcast  mode.  As  in  the  previous 
chapters,  we  assume  that  there  is  no  appreciable  transmission  or  computational  delay. 
Therefore,  after  an  observation  is  made  in  the  discrete  time  case,  the  nodes  will  compute  the 
locally  optimal  estimate  and  transmit  it  before  the  next  observation  is  made.  Furthermore, 
we  also  assume  that  all  the  sensors  are  engaged  in  time-synchronized  measurements  For 
the  continuous  time  case,  we  are  assuming  that  computation  at  and  transmission  from  each 
node  take  place  instantaneously.  We  will  consider  that  the  SDl  target  is  governed  by  linear 
time  varying  dynamics  and  measurement  equations. 

Recently,  there  has  been  a  great  deal  of  interest  in  the  decentralized  estimation. 
Speyer  (1979)  first  formulated  the  problem  in  an  LQG  framework  -  he  solved  the 
decentralized  estimanon  as  a  prelude  to  a  decentralized  control  problem  He  assumed  that 
the  global  and  local  models  were  idenncal.  Willsky  et  al.  (1982)  solved  the  problem  for  a 
more  general  case  of  nonidentical  global  and  local  models  -  they  have  also  solved  the 
distnbuted  smoothing  problem  in  this  paper.  Recently,  Hashemipout,  Roy  and  V-amb 
(1984)  have  expanded  the  distnbuted  estimation  problem  to  include  collocated  and  non- 
collocated  sensors,  and  have  provided  implementation  details  for  the  discrete  time  case.  In 
all  of  these  works,  the  authors  have  assumed  that  the  measurement  noise  processes  are 
uncorrelated  from  sensor  to  sensor.  The  primary  purpose  of  this  chapter  is  to  develop  a 
distnbuted  estimation  algonthm  when  the  sensor  noises  are  correlated.  The  underlying 


assumption  is  that  the  local  agents  have  the  full  knowledge  of  the  global  model  and  various 
statistics  of  the  system. 

The  organization  of  this  chapter  is  as  follows.  In  Section  4.2,  the  important  results 
about  the  decentralized  estimation  for  the  case  of  uncorrela.cd  sensor  noise  are  reviewed. 
The  case  of  correlated  sensor  noise  in  the  continuous  time  domain  is  presented  in  Section 
4,3  Several  schemes  for  optimal  estimation  are  presented  in  this  section;  some  of  these 
schemes  employ  centralized  processing  algorithms.  The  last  of  these  schemes  is  truly  a 
distributed  one.  We  will  consider  the  contuiuous  time  case  only,  the  corresponding  results 
for  the  discrete  time  case  v/ill  be  analyzed  in  Phase  11.  Finally,  a  summary  and  conclusion 
is  provided  in  Section  4  4. 

4.2  A  Review  of  Distributed  Estimation  Problem  in  Presence  of 

Uncorrelated  Sensor  Noise:  Continuous  Time  Case 

In  this  section  we  assume  that  the  measurement  noises  are  uncorrelated  across  the 
sensors.  Suppose  that  there  are  J  sensors  tracking  a  single  SDl  target.  The  dynamic  model 
of  the  target  and  the  measurement  equations  are  assumed  as  follows: 


x(t)  =  A(t)x(t)  +  w(t)  ; 

t^to 

(4.1a) 

Zj(t)  =  Cj(t)x(t)  +  Vj(t)  ; 

j=  1,2,...J. 

(4.1b) 

Various  statistic'  relating  to  this  model  are: 

E  [x(to)]  =  x(tfl),  E  [(x(to)  -  x(to) )  (x(to';  -  x(to)/]  =  E(to)  ,- 
E  [w(t))  =  0,  E  [w(t)w‘(x)]  =  Q(t)5(t"T) 

E(v/t)]  =  0,  E[v/t)v^(x)]  =  R/t)5{t-x)5ji.,  R/t)  =  R]'(t)>0 


We  also  assume  that  w(t)  is  independent  of  x(to)  and  of  Vj(t)  for  all  j  =  1  ,2  It  is  also 
assumed  that  v,(t)  is  independent  of  xflp). 

Since  the  underlying  aisumpuon  is  that  the  local  processors  have  the  full  knowledge 
of  the  global  model,  the  state  model  at  the  j-th  sensor  can  be  written  as 

Xj(t)  =  A(t)x,(t)  +  w(t) ,  (4.2a) 

with 

E  [  Xj(to)]  =  Xj(to),  E  (( Xj(to)  -  Xj(to) )  (  Xj(to)  -  Xj(to)  )^]  =  £j(to) . 

The  measurement  equation  and  other  related  statistics  are  same  as  above. 

The  globally  optimal  filtered  state  conditional  on  the  measurement  history  of  all  the 
sensors  is  given  by 

x(t)  =  E  [x(t)  I  Zi(Q,  Z2i0  ...Zj(C) :  to  2  ^  S  t  ] . 

This  estimate  is  propagated  as 

I 

i(t)  =  A(t)i(t)  +  Y,  (t)  (z/t)  -  Cj(t)i(t) ) (4.3a) 

1  =  1 

v/here 

Kj“(t)  =  P(t)cJ(l)R;'(t),  (4.3b) 

and  the  esumaie  error  covariance  niamx  P(t)  is  the  solution  of  the  Riccati  ajuauon 

j 

P(t)  =  A(t)P(t)  +  P(t)A  Ht)  +  Q(t)  -  2^?(t)C]'(t)R;‘(t)Cj(t)P(t)  (4.3c) 
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p(to)  =  J:(to). 


Clearly  the  mechanization  of  (4.3)  requires  that  all  Lhe  sensor  measurements  be  transmitted 
to  a  central  node  where  x(t)  will  be  generateo  We  have  used  the  notation  to  imply 
that  this  is  the  centralized  Kalman  gain  when  the  sensor  noises  are  uncorrelated. 

If  the  nodes  are  allowed  to  process  their  own  observations  locally,  the  local  optimal 
estimate  evolves  as. 

ij(t)  =  A(t)J/t)  +  Kf  (t)  [z/t)  -  Cj(t)i/t))  (4.4a) 

where 

Kf(t)  =  P/t)C]'(t)R;'(t)  (4.4b) 

and  the  error  covartance  matrix  Pj(t)  can  be  computed  from  the  Riccati  equation 

P/t)  =  A(t)P/t)  +  Pj(t)A'^(t)  +  Q(t)  -  P/t)Cj(t)R;‘(t)Cj(t)Pj(t)  (4.4b) 

Pj(to)  =  Sj(to). 

Here,  K*(t)  stands  for  the  decentralized  or  localized  Kalman  gain  associated  with  the  j-th 
node  in  presence  of  uncorrelated  sensor  noise.  The  problem  of  decentralized  or  distributed 

A  A 

estimation  is  to  reconstruct  x(t)  in  terms  of  Xj(t),  j  =  1,  2,  ...J  of  (4.4a)  Willsky  ct  al. 

A 

1 1982)  has  shown  that  x(t)  can  be  constructed  at  a  central  node  dynamically  from  the  local 
esumates  as  follows: 

j 

C(t)  =  F(t)C(t)  +  XK,(tM/0  (4.5a) 

j  =  i 
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(4.5b) 


I 

x(t)  =  ^(t)  +  ^P(t)P“’(t)Xj(t) 
j  =  i 

where 

J 

F(t)  =  A(t)  -  ^P(t)cJ(l)R;'(t)C/t) ,  (4.5c) 

j  =  i 

K/t)  =  [P(t)p;‘(t)  -  I]Q(t)P7'(t) .  (4.5d) 

A 

These  equations  show  that  each  node  j  must  send  to  the  central  node  its  local  estimate  Xj(t) 
and  the  associated  covariance  matrix  Pj(t).  The  structure  of  the  central  node  can  be 
simplified  using  Speyer's  (1979)  form  where  some  of  the  computational  burden  of  the 
central  node  is  transferred  to  the  local  nodes.  At  each  node  j,  a  data  dependent  vector  hj(t) 
is  generated  as  follows: 

hj(t)  =  F(t)hj(t)4-Kj(t)!J/t).  (4.6a) 

Then,  at  the  central  node,  the  global  estimate  is  constructed  as 

J 

x(t)  =  (?p(t,  to)x(t(,)  +  [P(t)P7'(t)Xj(t)  +  hj(t)l . 

j  =  i 

where  <}ip(t,  Iq)  is  the  tra.nsition  matrix  associated  with  the  dynamics  matrix  F(t)  The 
complexity  of  the  central  node  has  been  reduced  at  the  expense  of  higher  transmission 
requirements  -  now  each  node  j  must  send  to  the  central  node  2  vectors  the  locally  optimal 

A 

esumate  Xj(t)  and  a  data  dependent  vector  hj(0.  It  lias  been  shown  in  Chapter  2  that  the 
transmission  requirements  can  further  be  reduced  to  only  one  data  dependent  vector. 
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The  key  to  the  distnbuted  estimation  algonthm  (4.5)  and  (4.6)  ts  the  fact  that  the 

A 

global  esdmaie  x(t)  can  be  decomposed  in  terms  of  the  locally  computed  data  dependent 
vectors  as  shown  in  these  equations  This  decomposition  has  been  possible  because  the 
sensor  noises  are  uncorrelated  It  s  the  purpose  of  the  next  section  to  show  how  this 
decomposition  can  be  achieved  when  the  sensor  noises  are  correlated  across  the  sensors 

4.3  Distributed  Estimation  with  Correlated  Sensor  Noise:  Continuous 
Time  Case 

Consider  again  the  dynamic  model  (4.1)  having  the  same  process  noise  and  initial 
condition.  But  this  time  we  assume  that  the  sensor  noises  are  correlated  Specifically  we 
assume 

E(v/t)l  =  0,  E(v/t)vI(x)]  =  Rjk(l)6(t-T)  (4.7) 

As  before,  we  assume  that  the  process  noise  w(t)  is  independent  of  xftg)  and  of  Vj(t)  for  all 
j  =  1,2,  ...J.  Also,  Vj(t)  IS  independent  of  xftp).  The  optimal  estimation  problem  is  first 
solved  for  the  centralized  case,  i.e.,  where  the  raw  measurements  from  all  the  sensors  are 
transmitted  to  a  central  node. 

A  Centralized  Solution  for  the  Optimal  Estimation  Problem 
The  opnmal  filtered  estimate  is  given  by 

x(t)  =  E  (x(t)  I  Zj(Q;  tfl^ .j  =  !•  2, ..  J  ] 

and  can  be  computed  as  follows  in  a  straightforward  way  Define 
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’zi(t)‘ 

'c,(t)' 

’V](t)’ 

6 

Z2(t) 

A 

c^t) 

V2(t) 

z(l)  = 

Zj(t) 

;  C(t)  = 

c/t) 

;  v(t)  = 

Vj(t) 

Then  all  the  sensor  measurements  can  be  compactly  represented  as 

2(0  =  C(t)x{t)  +  v(t)  (4.8b) 

Here  v(t)  is  a  white  noise  process  whose  covariance  matnx  is  given  by 

E  (v(t)v\t)]  =  R(i)S(t-t) ;  R(t)  =  R'^(t)>0  (4  9a) 

where 

R(t)  =  bloclclR,j(t);  i,j=l,2,..J)  (4.9b) 

Clearly,  for  the  case  of  uncorrelated  sensor  noise,  R(t)  is  a  block  diagonal  matrix  and 
It  IS  this  block  diagonal  structure  of  the  R(t)  matnx  that  has  led  to  the  decomposition  of  the 
global  estimates  as  in  Equations  (4.5)  -  (4.6)  For  the  correlated  sensor  noise  this  is  a 
"full"  matrix.  However,  irrespective  of  the  structure  of  this  matnx,  the  optimal  filtered 
estimate  is  given  by 

x(t)  =  E[x(t)  I  z(Q;  tfl^^St] 

which  can  be  computed  from  the  standard  Kalman  filter. 

k(t)  =  A(t)i(t)  +  K“(t)[z(t)-C(t)i(t)) ,  (4.i0a) 

K“(t)  =  P(t)C’^(t)R-‘(t),  (4.10b) 


and  the  covariance  matrix  P(t)  obeys 
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P(t)  =  A(t)P{t)  +  P(t)A'^(t)  +  Q(t)  -  P{t)d(t)R''(t)C(t)P(t)  (4.10c) 

P(to)  =  I(to). 

C(t)  and  R(t)  have  been  defined  in  (4.8a)  and  (4.9al  rcpectively.  Here  K“(t)  stands  for 
the  centralized  Kalman  gain  for  the  correlated  sensor  noise  The  scheme  of  (4  10)  can  be 
mechanized  at  a  central  node  where  all  the  sensor  measurements  Zj(t)  are  transmitted  to 
form  the  aggregate  measurement  vector  z(t) 

Alternatively,  this  vector  z(t)  can  be  transformed  into  pseudo-measurement  vector  z(t) 
such  that  the  measurement  noise  associated  with  this  new  vector  corresponding  to  various 
sensors  is  uncorrelated  To  obtain  z(t),  decompose  R(t)  in  such  a  way  that 

R(t)  =  R'®(t)R'^"(t)  (4.11a) 

where  R'''2(t)  is  a  positive  definite  square  root  of  R(t).  Then 

i(t)  =  R*''^(t)z(t)  (4.11b) 

=  C(t)x(t)  +  ;(t) 

where 

C(t)  =  R‘‘'^(t)C(t) ,  C(t)  =  R^'^^vfi) .  (4. 1  Ic) 

Clearly,  the  covariance  matrix  R(t)  of  v(t)  is  the  identity  matrix  for  all  t  and  satisfies 
therefore  the  new  measurement  noise  v(t)  is  uncorrelated  across  the  sensors.  It  can  be 
venfied  in  a  straightforward  way  that  the  Riccati  equations  associated  with  the  Kalman 
filters  for  the  original  form  (4.10)  and  the  new  measurement  coordinate  systems  (4.1 1)  are 
the  same  Therefore  the  optimal  estimate  in  the  new  measurement  system  is  given  by 
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(4.12a) 


j 

x(t)  =  A{t)x(t)  +  ^K“(t)  [ij(t)  -  Cj(t)x{t)] 
j  =  i 

Kf(t)  =  P(t)C/t)  (4.12b) 

where  P(t)  is  obtained  from  (4.10c)  and  Cj(0  is  the  j-th  submatrix  of  C(t)  corresponding  to 
the  j-th  sensor.  Similarly,  z,(t)  is  the  corresponding  j-th  subvector  of  z(t).  This  scheme 
looks  the  same  as  for  the  uncorrelated  sensor  noise  as  shown  in  (4  3),  but  yet  can  not  be 
implemented  in  a  decentralized  framework.  The  reasons  are  as  follows,  although  Cj(t)  and 
K“(t)  can  be  computed  at  the  j-th  node,  but  not  Zj(t).  In  fact  Cj(t)  and  K“(t)  can  be 
computed  and  stored  apnori.  In  order  to  compute  z,(t),  z(t)  must  be  formed  first  at  that 
node  and  therefore  must  receive  the  raw  observations  from  all  other  nodes.  Once  z(t)  is 
formed,  it  is  to  be  multiplied  by  R  ’®(t)  and  z^vt)  is  the  j-th  subvector  of  the  resulting 
vector.  Our  goal  is  to  process  the  raw  observation  data  locally  and  then  transmit  the 
processed  data  to  a  central  node. 

Distributed  Optimal  Estimation 

In  this  section  we  will  show  how  each  node  will  generate  a  data  dependent  vector 
from  us  own  observations  only  and  without  utilizing  data  from  other  sensors.  These 
vectors  from  all  the  nodes  will  be  communicated  to  a  mixing  node  where  these  will  be 
fused  into  the  globally  optimal  estimate.  The  main  result  is  presented  in  the  following  fact. 

Fact  1 

Suppose  at  each  node  j,  a  data  dependent  vector  qj(t)  is  generated  as  follows: 

qj(t)  =  F(t)qj(t)  +  K“(t)z/t);  qj(to)  =  0.  (4.13a) 
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where 


F(t)  =  A(t)-K“(t)C(t)  (4.13b) 

and  Kj '(t)  is  the  j-th  submatiix  from  a  compatible  decomposition  of  K“(t)  of  the  form 

K“(t)  =  [Kf(t).Kf(t)  ..KfW]  (4.13c) 


Then 

J 

x(t)  =  <^F(t,  to)x(to)  +  S'!/')  (4.13d) 

1  =  1 

Here  K<i<^(t)  is  the  centralized  Kalman  gain  given  in  (4.10b)  and  C(t)  is  the 
measurement  matnx  associated  with  the  aggregate  measurement  vector  z(t)  as  shown  in 
(4.8);  K“(t)  is  the  appropriate  block  of  K“(t)  corresponding  to  thej-th  sensor  and  ki) 
is  the  transition  matrix  associated  with  F(t). 

This  ts  a  simple  yet  remarkable  result.  Since  the  underlying  assumption  is  that  the 
local  agents  have  the  full  knowledge  of  the  global  model,  all  the  variables  of  (4.13b)  are 
available  to  the  j-th  processor.  K“(t)  can  be  locally  computed  or  can  be  transmitted  from  a 
central  coordinator  -  in  either  case  it  can  be  stored  aprion  at  the  j-th  sensor  The  same 
comment  applies  to  the  construction  of  F(t).  Moreover,  each  local  processor  can  be 
initialized  to  zero  thus  avoiding  the  requirement  of  a  complicated  initial  setting.  Therefore 
qj(t)  IS  truly  a  node  dependent  vector  and  can  be  constructed  from  the  observations  at  that 
node  only. 

An  implementation  of  this  scheme  is  shown  in  Figure  4.1.  It  can  be  seen  from  this 
figure  that,  the  node  j  docs  not  have  an  implicit  structure  of  a  Kalman  filter,  neither  the 
central  node  has  a  Kalman  filter  in  it.  Each  node  sends  only  one  data  dependent  vector  qj(t) 
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to  the  central  node  -  thus  the  transmission  requirement  is  minimal  The  structure  of  the 
central  node  is  also  quite  simple;  if  xCtg)  =  0,  it  is  merely  an  adder. 


Proof 


A 

The  proof  is  straightforward.  We  will  show  that  x(t)  generated  this  way  i.ideed 
satisfies  the  standard  Kalman  filter  equations  given  in  (4  10)  Clearly,  since  qj(to)  =  0, 


qj(t)  =  I  ()ip(t.  T)K“(x)Zj{x)dt 


(4.14a) 


Substituting  qj(t)  in  (4.13d),  we  get 

J  ^ 

x(t)  =  <i)F(t,  to)x(to)  +  X  1  't)K“(x)Zj(x)dx  .  (4  14b) 

,  =  i  'o 

Differentiating  both  sides, 

j 

x(t)  =  F(t)x(t)  +  X  Kf  (t)zj(t) .  (4.14c) 

J  =  1 

Finally,  rearrangement  of  this  equation  gives 

kt)  =  A(t)i(t)  +  K"(t)  [z(t)  -  C(t)x(t)l  (4. 1 5a) 

which  is  the  same  as  (4.10).  ■ 

There  are  several  interpretations  of  qj(t).  For  the  case  of  uncorrelated  sensor  noise,  it 
is  the  locally  optimal  estimate.  When  the  sensor  noises  are  correlated,  qj(t)  can  be  regarded 
as  a  sub-optimal  estimate  of  the  global  state  as  viewed  by  the  local  agent  This  vector  will 
be  studied  in  detail  in  Phase  II. 


4.4  Conclusions 

The  deceniialiEed  estimation  problem  for  the  correlated  sensor  noise  has  been  solved 
in  this  chapter.  The  underlymg  idea  of  the  decentralized  estimation  is  that  the  observations 
are  processed  locally  by  each  node  and  then  the  processed  data  is  sent  to  a  central  node 
where  the  globally  opumal  estimate  is  constructed  The  problem  with  uncotrelated  sensor 
noise  has  been  reviewed  first  in  Section  4.2,  because  most  of  these  results  were  used  to 
derive  conesponding  results  for  the  correlated  sensor  noise  case.  It  has  been  shown  in 
Section  4.3  that  for  the  later  case,  each  node  must  generate  a  data  dependent  vector  qj(t) 
which  IS  then  transmitted  to  the  central  node  and  fused  with  similar  vectors  from  other 
nodes  into  the  globally  optimal  estimate.  The  detailed  properties  of  this  vector  are  yet  to  be 
investigated. 

We  have  solved  this  problem  for  a  special  case  of  identical  global  and  local  models 
When  the  models  differ,  the  analysis  is  very  difficult  and  will  be  taken  up  in  the 
forthcoming  Phase  H. 


central  node 

Figure  4.1;  Distributed  Estimation  with  Correlated  Sensor  Noise 


CHAPTER  5 


DEVELOPMENT  AND  APPLICATION  OF  DECENTRALIZED  MODIFIED 
GAIN  EXTENDED  KALMAN  FILTER  TO  SDI  TRACKING 

5.1  Introduction 

In  this  chapter,  the  results  of  decentralized  estimation  of  the  previous  chapters  are 
extended  to  a  special  class  of  nonlinear  measurements  encountered  in  the  SDI  tracking 
problem  The  IR  sensors  used  m  the  SDI  system  typically  provide  information  on  bearings 
of  the  targets  Rtdar  and  Laser  trackers  can  provide  additional  information  on  range  and 
range-rate  If  the  motion  of  the  target  is  descnbed  in  a  rectangular  coordinate  system,  the 
above  measurements  are  related  nonlinearly  to  the  state  vanables  of  the  target 

The  approach  used  to  extend  the  results  of  the  linear  decentralized  Kalman  Filter  to 
the  above  case  is  to  use  Modified  Gain  Extended  Kalman  Filters. 

In  this  chapter,  a  new  nonlinear  filter,  whose  structure  is  similar  to  that  of  the 
Extended  Kalman  Filter  (EKF),  is  applied  to  the  estimation  problem  using  a  strapdown 
seeker.  This  nonlinear  filter  is  based  upon  the  system  nonlineanties  being  members  of  a 
special  class  of  functio-  ailed  modifiable.  The  essential  idea  is  that  the  difference 
between  the  nonlinear  function  at  an  unknown  state  and  a  known  state  is  equal  to  a  linear 
function  in  the  difference  between  the  unknown  and  known  state  operated  upon  by  a  matrix 
function  composed  only  of  the  known  state  and  the  measurement  functions.  Thus,  this 
special  class  possesses  a  natural  linearity.  Although  this  class  is  quite  small,  it  does  include 
several  of  the  nonlineanties  present  in  the  SDI  tracking  problem.  The  theoretical  analysis 
of  centralized  estimators  containing  this  type  of  nonhneanty  with  interesting  applications 
are  given  in  Song  and  Speyer  (1984),  Safonov  and  Athans  (1978)  The  theoretical  results 
show  that  if  no  noise  inputs  are  present,  the  filter,  acting  as  an  observer,  is  globally  stable 
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If  noise  is  present,  the  centralized  filter  is  shown  with  additional  conditions  to  be 
stochasDcally  exponennally  bounded  The  convergence  of  the  estimates  to  unbiased  values 
has  been  indicated  mostly  by  simulation  (Song  and  Speyer,,  1984)  but  for  the  parameter 
identification  problem  can  be  shown  analytically  (Safonov  and  Athans,  1978) 

Since  the  class  of  nonlinear  functions  that  are  modifiable  may  not  include  all  the 
nonlineanties  present  in  a  state  estimation  problem,  a  filter  structure  involving  both 
modifiable  and  nonmodifiable  functions  is  possible  for  sophisticated  applications  such  as 
SDl  Consequently,  those  nonlinearities  which  are  modifiable  are  included  in  the 
estimation  algorithm  in  their  modifiable  form  and  those  that  are  not  modifiable  are  included 
as  they  would  be  in  a  standard  EKF.  In  the  SDi  tracking  problem,  the  essential  or  most 
important  nonlineanties  belong  to  the  class  of  modifiable  nonlineanties.  The  essential, 
nonlineanties  are  the  functional  forms  of  the  elevation  and  azimuth  angles,  the  range,  and 
the  range  rate  formulated  in  rectangular  coordinates  The  misalignment  errors  produce 
nonlineanties  which  are  included  by  the  lineanzation  method  of  the  standard  EKF.  The 
resulting  filter  is  called  the  modified  gam  EKF  (MGEKF),  since  only  the  gam  calculahons 
are  somewhat  different  from  those  of  the  EKF  but  not  the  estimator  structure 

If  both  angles  and  range  and  range  rate  are  measured,  then  the  anticipated  advantages 
of  the  MGEKF  over  the  EKF  may  not  be  clearly  seen  since  in  both  schemes  the  large 
number  of  measurements  allows  good  estimation  of  the  states  However,  if  only  angle 
information  is  present  which  is  the  case  with  IR  measurements,  then  it  is  anticipated  that 
sign’fi  -ant  improvement  m  filter  response  and  stability  should  be  evident  m  the  MGEKF 
formulation 
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5.2  Modifiable  Functions  in  the  SDI  Trackinf'  Problem 

The  measurements  considered  here  are  the  elevation  and  azimuth  angles,  range  and 
range  rate  or  any  subset  of  these  measurements.  These  measurements  are  put  in  modifiable 
form  first.  Then  tlie  modification  to  the  gain  calculanon  is  given. 

Modifiable  Form  of  Measurements 


Let 


A 
X  = 


X' 

Y 

Z 


(5.1) 


denote  the  SDi  target  posmon  in  inemat  coordinate  frame  and 


A 

[Xb 

'X' 

X  = 

Yb 

=  T 

Y 

-B 

Zb 

Z 

(5.2) 


denote  the  target  posmon  in  a  body  axis  system  centered  at  the  sensored  platform  wheie  T 
denotes  the  transformanon  matrix  between  the  two  coordinate  frames. 

The  measurements  are  given  in  detemamsuc  form  as  angle  measurement  functions: 


Ey 

il 

Ky 

^yz 

Xyz 

Kz 

tan  (Yg/Xg) 


tan  ( 


-Zr 


V  x^+y| 

where  Qf  =  [Gy,  Gz]'''is  the  glint  noise  state  vector 


(5.3) 


range  function:  R  =  Vx^+Y^Z^  =  Vxg  +  Yg  +  Zg  ,  (5-4) 
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I 

I 

I 

a 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

i 

I 

I 


The  actual  measurement  is  given  by  z  =  z*  +  v  where  v  is  white  zero  mean  Gaussian 
noise  The  global  exponential  boundedness  for  the  stochastic  case  are  obtained  by 
assuming  that  cenain  finite  gam  operators  associated  with  the  additive  measurement  noise 
do  not  destabilize  the  system  Results  of  this  son  are  given  in  Song  and  Speyer  (1985) 
with  respect  to  a  constant  gain  EKF.  The  results  of  this  chapter  extend  these  results  to  time 
varying  gam. 

5.3  The  Definition  of  a  Modifiable  Function 
The  measurement  function  is  modifiable  if 

h(X,a,Y.K)-h(X.V,G,K)  =  F(z*.X,(i,V,K)  (5.7) 

K-K 

where  X..  Q.  Y,  K  are  unknown  values  but  X,  V,  K  are  estimated  values  of  the  state 
and  are  thereby  known.  Note  ihat  F  w'dl  be  evaluated  using  z  rather  than  z*  in 
implemennng  the  filter.  Funhermore,  note  that  for  continuous  functions 
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(5.8) 


F(z*,2(,y,G) 


ah 

9(X..Q..y,ID 


(X.G.,y.K)  =  (K,y.£i.K) 


Therefore,  in  the  limit  the  MGEKF  converges  to  the  EKF,  as 
(X,  Q.  y)  =»  (X,  y.  Q)  This  should  be  used  as  a  check  on  the  numerical 
implementanon 


5.4  List  of  Preliminary  Functions 


The  following  funcnons  are  required  in  implementing  the  MGEKF. 


az  =  tan 

Xfi 


-r  ,  -1  -Zb 
el  =  tan 


Vxb^Yb^ 


H(az,  el)  = 


sin  az,  -cos  az. 


0 


sin  el  cos  az,  sin  el  sin  az,  cos  el 


(5  9) 


(5.10) 


(5.11) 


E  = 


Ian  a 


(cos  az  Xb+  sin  az  Ys 


Ian  P 


_2  _2  '2  _ 
1cosc1(Xb+Y^  -sincclZzjP 


(5.12) 


cos  az  Xb+  sin  az  Yb 


_2  _2  '2 

COS  cl(X  b  +  Y^  -sinclZB 


H2fB 


'u 

II 

tY 

lelj 

cz 

-  k'‘g 


(5.13) 


(5.14)’ 
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]  =  r' 

j 

cos  az  -  cos  az 


az* 

az 

sm  az  - 

sin  az 

az- 

az 

cos  el  - 

cos  el 

el  - 

d 

sin  el  - 

sin  el 

el  -  el 


W  , - 

cos  az  =  7  ^ 

/VXb+Yb 

sm  az  =  ^7  ^ 

/  VXb+Yb 


-T  Zb 
cos  el  =  -=- 
R 


^  Vxb+Yb 


sin  el  =  - 


‘B 

R 


R  =  Vx 

Fa=-KEHT' 

P  _  az,  el,  0 
~  0,  az,  el 


(5.14)  ‘ 
(5  15)  t 

(5.16)  + 

(5.17)  t 

(5.18)  t 

(5.19) 

(5.20) 

(5.21) 


(5.22) 

(5.23) 

(5.24) 

(5.25) 


'  Note  Cy,  E^.  and  R  are  replaced  by  tficir  associated  noisy  measuremenis  when  the  MGEKF  is 
implemented 

^  If  (5.15)  to  5  20)  have  numerical  difficulty  due  to  division  by  a  small  number  then  approximate  as 
a,  =  sinO,  aj  =  cos9.  e|=sin$.  c^^cosd  where  0=(I/2)(a2+»z)  and  ^(l/2)(el+et) 
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(5.26) 


Fs= 


(cos  az  cos  el,  sin  az  cos  el,  -sin  el)  -  (Xg  cos  el  aj  |  -i 


+  Yg  cos  el  a2,  Xg  cos  az  e,  +  Y  sin  az  63  -  Z  EH  j 


F  gv  =  (cos  el  cos  az,  cos  el  sin  az,  -sin  el)  T’ 


RR- 


/(Vx 

Vy 

Vz\ 

/xVx+YVy+ZVz^^  \ 

\U 

R 

R/ 

1  RR  I 

(5.27) 


(5.28) 


5.5  The  Modifiable  Forms  for  the  Measurements 


The  measurement  function  given  by  Eq.  (5.6)  is  only  approximately  modifiable.  The 
approximate  modifiable  form  given  by  Eq  (5.7)  is  presented  below  The  angular 
measurements  become  approximately  modifiable  as 


Ex. 

y 

y 

L  yJ 

L  yJ 

=  Fa(x-x)  +  2-G  +  Fm(K-K) 


(5.29) 


where 


K  = 


Ky 

Kyz 

K7 


(5  30) 


The  range  measurement  becomes  approximately  modifiable  as 
R-RsFr(X-X). 


(5  31) 


The  range  rate  measurement  becomes  approximately  modifiable  as 

R-^=FRg(X-S)  +  FRv{Y-V). 


(5.32) 


Define  an  error  vector  as  (note  the  acceleration  error  is  not  included  but  can  be  done 
trivially) 
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(5.33) 


e  =  (X-X,y-V,G-G.K-K)'^ 

Then,  the  modifiable  form  is  given  by  Eq.  (5.7)  where 

F  =  F(z*,X,V.G.K) 

Fa  0  1  Fm 

=  Fa  0  0  0  (5.34) 

Frr  Frv  0  0 

and  where  the  elements  in  the  matrix  are  given  in  section  4. 

5.6  Change  in  the  Update  Equation  for  Gain  Calculation  in  the  MGEKF 

The  update  formulas  for  the  EKF  and  the  gam  calculation  are  the  same  for  the 
MGEKF,  except  that  the  update  equation  for  the  error  vanance  equation  when  processing  a 
measurement  vector  is  changed  from 

P  =  (I-K^,)M(I-K(h,)  +  KfVKf  ,  (5.35) 

where  Kf  is  the  EKF  gain,  M  is  the  a  pnon  "error  variance"  before  a  measurement  is 
processed,  and  P  is  the  postenon  "error  vanance,"  to 

P  =  (1-KmF)M(I-KmF)  +  KmVkI,  (5.36) 

where  K^j  is  the  gam  of  the  MGEKF  Therefore,  the  only  change  in  the  estimation 
algonthms  is  that  h,  is  replaced  by  F.  Note  that  F  =  f(z,  X.  G.  K).  i-C-.  F  is  calculated 
using  z  rather  than  z*. 

We  now  extend  the  above  results  to  the  case  of  distnbuted  filtenng  for  the  SDI 
tracking  problem.  The  structure  of  the  MGEKF  filter  is  similar  to  the  KF  for  the  linear 
case. 
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5.7  The  Local  MGEKF 


Suppose  that  tlie  suite  of  nonlinear  measurements  for  sensor  fusion  are 

Zj(k)  =  gj(x(k))  +  Vj(k)  N(k)  (5.37) 

wliere  the  noiseless  measurement  is  Zj*  =  gj(x)  and  N(k)  is  the  number  of  sensors  that  are 
to  be  combined  at  time  stage  k.  The  number  may  vary  depending  upon  the  purity  of  the 
current  local  estimators. 

It  IS  assumed  that  all  the  measurement  functions  are  modifiable  functions  or  at  least 
approximately  modifiable.  Thus, 

gj(x)  -  gj(x)  =  H(Zj*,  x)  (x  -  x)  (5..38) 

let  us  consider  what  this  means  for  the  local  esnmator  equation 

Xj(k)  =  Xj(k)  +  Kj(k)  (z,(k)  -  gj(Xj) )  .  (5.39) 

Xj(k)  =  Xj(k  I  k-1) 

where  Kj(k)  is  the  gam  of  the  MGEKF.  Note  that  the  estima.or  can  be  represented  in  terms 
of  the  modifiable  function  as 

Xj(k)  =  x/k)  +  Kj(k)H(Zj*,  Xj)  (x(k)  -  i/k) )  +  K/k)v/k)  (5.40) 

a  .  .  a 

Therefore,  the  error  equations  for  Cjfk)  =  x(k)  -  Xj(k)  and  ej(k)  =  x(k)  -  Xj(k)  are 

e/k)  =  [I  -  Kj(k)H(Zj»,  Xj)l  e/k)  -  Kj(k)vj(k)  (5.41 ) 

ej(k+l )  =  A(k)ej(k)  +  w(k)  (5.42) 

where  (5.42)  is  obtained  directly  from  the  dynamic  state  equation.  Note  that  without  any 
approximation,  assuming  modifiable  nonlineanties,  the  error  equations  (5.41)  and  (5.42) 
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are  linear,  even  in  the  presence  of  noise.  The  essential  improvement  over  the  EKF  is 
making  use  of  this  observation  in  constructing  the  gam  algorithm  for  Xj(k). 

In  constructing  the  gam  algonthm  one  consideration  was  that  the  esnmates  should  not 
be  biased  It  is  shown  in  Song  and  Speyer  (1984)  that  if  the  gain  is  a  function  of  the 
present  measurement,  then  the  estimator  is  highly  biased.  To  avoid  this,  the  gam  was 
assumed  using  Hj(k)  A  Hj(gj(Xj(k)),  Xj(k))  to  be  of  the  form 

Kj(k)  =  Mj(k)flj(k)  [n,(k)'^Mj(k)H/k)  +  Rj(k)  (5.43) 

where  we  have  used  the  notation  Mj(k)  =  P.(k  I  k-l).ln  this  way,  Kj(k)  is  only  a  function 
of  past  measurements.  It  should  be  noted  that  Hj(k)  is  essentially  the  partial  of  gj  with 
respect  to  x  evaluated  at  x  =  Xj(k).  In  this  way,  the  algonthm  is  still  that  of  the  EKF  The 
difference  anses  in  how  the  mamces  Pj  and  Mj  are  propagated.  These  will  be  referred  to  as 
postenon  and  a  pnon  error  vanances,  but  it  must  be  remembered  that  these  are  not  actually 
statisncs  and  are  really  a  kind  of  quasi-vanance  This  statement  also  applies  to  the  so-called 
statistical  properties  of  the  EKF.  The  error  variances  are  propagated  as 

Pj(k)  =  [I  -  KjH/z,,  x,)]Mj(k)(I  -  K.Hj(Zj,  Xj)]^  +  Kj(k)Rj(k)K/k)’''  (5.44) 

M/k+1)  =  A(k)Pj(k)A(k)'^  +  Q(k)  (5.45) 

where  we  have  used  the  notation  Pj(k)  =  Pj(k  I  k)  and  Mj(k-l-l)  =  Pj(k+1  1  k).  Note  that 
Hj(Zj,  Xj)  is  used  in  the  update  of  Pj(k)  where  the  actual  measurement  is  used. 
Furthermore,  in  the  absence  of  measurement  noise  Pj(k)  reflects  the  actual  error  (5.41) 
whereas  in  the  EKF  the  associated  error  vanance  does  not  reflect  the  actual  error  (5  41) 
hinally,  rt  is  shown  in  Song  and  Speyer  (1984)  that  if  <.*  is  used  rather  than  Zy  then  under 
reasonable  assumptions  the  error  is  exponentially  bounded  in  mean  square. 


5.8  The  Global  MGEKF 


The  local  estimates  are  to  be  combined  by  the  memoryless  formula  given  in  earlier 
chapters.  Since  the  actual  error  is  reflected  by  (5.44),  it  seems  reasonable  to  combine  the 
estimates  according  to  the  formula  for  the  linear  case  where  the  effect  of  the  error  size  is 
reflected  by  Pj(k).  At  present  we  have  no  bounds  on  the  quality  of  Pj(k)  in  reflecting  the 
true  error  variance,  but  determining  these  bounds  is  a  subject  of  additional  research.  Since 
the  EKF  IS  based  upon  a  local  linearization,  there  is  no  reason  to  believe  that  the  pseudo¬ 
error  vanance  reflects  the  actual  error  variance  unless  the  actual  error  is  small.  The 
quesdon  arises  as  to  what  estimate  is  best  to  perform  the  required  local  linearization.  Since 
the  MGEKF  is  based  on  a  universal  linearization  and  not  a  local  linearization,  this 
restriction  does  not  apply. 

In  developing  a  global  estimate,  both  P(k)  and  hj(k)  need  to  be  computed.  The 
calculation  of  P(k)  should  be  simply  as 
NCk) 

P(k)''  =  M(k)'‘  +  ^  H/z.,  x/  R/k)'’H(Zj,  Xj)  (5,46) 

1=1 

where  M(k)  is  propagated  using  the  same  propagation  formula  as  for  the  linear  case. 
However,  it  is  suggested  that  during  periods  where  certain  sensors  are  in  doubt  that  this  be 
calculated  with  respect  to  each  of  the  suspect  sensors  eliminating  the  other  suspect  sensors 
For  example,  suppose  that  the  IR  sensor  is  affected  by  a  flare  and  two  signatures  are  being 
tracked  It  is  suggested  that  (5.46)  be  calculated  for  the  three  situations  each  of  the  two 
tracks  on  the  IR  sensor  and  without  the  IR  sensor  If  the  IR  sensor  is  eliminated,  the 
glyb,il  caiimales  ai\.  uiKufected  by  the  countermeasure.  However,  when  one  of  the  two  IR 
tracks  arc  identified  as  the  flare,  the  information  from  the  other  IR  track  is  now  known  to 
be  correct  signal,  and  this  information  is  used  to  obtain  a  better  estimate  than  when  the  IR 


sensor  is  eliminated.  The  difficulty  expressed  here  is  due  to  the  fact  that  the  error  variances 
cannot  be  computed  off-line  for  the  nonlinear  estimation  problem. 

Once  P(k)  is  obtained  from  (5.46)  then  hj(k)  of  (5.43)  can  be  computed.  Note  that 
both  F(k)  and  Gj(k)  require  the  calculation  of  the  global  covanances  P(k)  and  M(k).  It  is 
again  suggested  that  dunng  periods  when  anomalies  are  present  that  hj(k)  be  calculated  for 
each  hypothesis  and  used  only  when  the  anomaly  is  known  not  to  be  present  in  the  sensor 
data  as  determined  by  the  detection  and  isolation  scheme. 
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CHAPTER  6 

SUMMARY  AND  FUTURE  RECOMMENDATIONS 
6.1  Summarv  and  Findings 

The  main  goal  of  this  project  is  to  solve  the  decentralized  estimation  problem  in 
presence  of  the  correlated  sensor  noise  in  a  multisensor  environment.  This  is  a  three  phase 
project:  Phase  I,  Phase  II,  and  Phase  III.  In  Phase  I,  the  feasibility  of  the  underlying 
techniques  will  be  established.  In  Phase  II,  the  the  problem  will  be  formulated  in  a  more 
general  setting  and  the  techniques  will  be  developed  in  details  followed  by  a 
commcrcializanon  plan  in  Phase  III.  The  work  reponed  here  is  the  outcome  of  the  Phase  I 
effon. 

The  subject  problem  is  a  very  significant  one  in  an  SDI  framework.  Typically  an 
SDI  target  is  tracked  by  one  or  more  sensors  located  on  different  orbiting  satellites.  The 
problem  is  so  estimate  the  target  state  from  all  of  these  measurements  which  can  be 
computed  either  using  centralized  or  decentralized  estimation  techniques  In  a  centralized 
technique,  all  the  sensor  measurements  are  transmitted  to  a  central  node  and  the  state  is 
estimated  by  utilizing  all  the  information  simultaneously.  On  the  other  hand,  in  a 
decentralized  framework,  the  nodes  process  the  observed  data  locally  and  then  transmits  the 
processed  data  to  the  central  node  where  these  are  fused  into  the  globally  optimum  estimate 
The  decentralized  estimation  is  superior  to  a  centralized  one  from  system  survivability  and 
computational  considerations. 

In  a  realistic  situation,  the  sensor  noises  are  correlated  because  these  sensors  are 
observing  the  same  target  through  the  same  atmospheric  medium.  In  this  case,  the 
decentralized  estimation  is  more  difficult  than  the  case  of  the  uncorrelated  sensor  noise. 
There  has  lately  emerged  a  considerable  amount  of  literature  in  the  area  of  decentralized 


71 


estimation,  but  none  of  these  has  addressed  the  problem  with  the  correlated  sensor  noise. 
In  Phase  1,  we  have  addressed  this  issue  and  found  a  solution  of  a  simpler  version  of  the 
problem  under  the  assumption  that  the  local  processors  have  the  full  knowledge  of  the 
global  model  and  the  system  uncertainnes  The  mor.  general  version  of  the  problem  with 
non-identical  global  ano  local  models,  will  be  analyzed  in  Phase  II.  A  bnef  summary  of 
this  report  follows. 

An  up  to  date  available  results  in  the  area  of  decentralized  estimation  with  uncorrelated 
sensor  noise  have  been  reviewed  in  Chapters  2  and  3  Chapter  2  is  devoted  to  the 
continuous  time  case  and  Chapter  3  to  the  discrete  time  case  These  reviews  have  put  the 
problem  with  the  correlated  sensor  noise  in  the  right  perspective  and  exposed  the  difficulty 
m  solving  It.  VVe  have  made  some  non-tnvial  extensions  of  the  available  results  which 
have  also  been  incorporated  into  these  chapters 

The  general  problem  of  decentralized  estimation  with  non-identical  global  and  local 
models  has  been  presented  in  Chapter  2  We  have  followed  the  works  of  Willsky  et  al 
(1982j  in  developing  this  chapter.  The  main  result  is  given  in  Equation  (2.14)  and  is 
shown  by  block  diagram  of  Figure  2.3.  This  shows  that  the  globally  optimal  estimate  can 
be  constructed  at  a  central  node  dynamically  from  the  local  estimates  But  the  structure  of 
the  central  node  in  this  scheme  is  complicated.  We  have  reduced  this  complexity  in  the 
scheme  of  Equation  (2.15)  and  the  corresponding  block  diagram  of  Figure  2.4.  In  this 
scheme  each  node  must  send  2  vectors  -  an  optimal  estimate  of  the  state  and  another  data 
dependent  vector,  to  the  central  node.  The  structure  of  the  central  node  as  well  as  the 
transmission  requirement  has  been  reduced  further  in  the  scheme  of  Equation  (2.16)  and 
Figure  2.5.  In  this  scheme  some  computational  burden  has  been  transferred  from  the 
central  node  to  the  local  nodes  and,  as  a  result,  each  local  node  needs  to  send  only  one  data 
dependent  vector  to  the  central  node.  However,  in  the  process,  the  complexity  of  the  local 
nodes  increases.  To  remove  this  drawback,  an  improved  scheme  is  presented  in  Equations 


(2  17)  and  Figure  2.6.  In  this  final  scheme,  there  is  no  need  of  constructing  an  optimal 
filter  explicitly  at  each  node  and  the  dynamical  equation  at  that  node  can  be  initialized  to 
zero  This  last  scheme  plays  an  important  role  in  the  development  of  the  corresponding 
results  for  the  case  of  correlated  sensor  noise  in  Chapter  4. 

Chapter  3  is  the  mirror  image  of  Chapter  2  but  for  the  discrete  time  case.  This  chapter 
IS  included  to  expose  some  implementation  details  which  are  distinct  from  the  continuous 
time  case.  The  major  results  of  this  chapter  are  due  to  Speyer  (1979)  who  solved  the 
problem  with  uncorrelated  sensor  noise  and  identical  global  and  local  models.  We  have 
extended  this  result  to  include  the  case  of  dissimilar  local  and  global  models.  The  main 
result  IS  given  in  Equations  (3. 15)  and  the  corresponding  block  diagram  of  Figure  3.3  As 
in  the  continuous  time  case,  each  node  must  send  2  vectors  to  the  central  node.  The 
transmission  requirement  and  the  complexity  of  the  global  and  local  nodes  are  reduced  in 
the  scheme  of  Equations  (3.17)  and  Figure  34  In  this  scheme,  there  is  no  explicit  Kalman 
filter  at  any  node  and  each  node  needs  to  send  only  one  data  dependent  vector  to  the  central 
node. 

The  subject  issue  fo  this  project,  i.c.  the  decentralized  estimation  problem  with 
correlated  sensor  noise  is  presented  in  Chapter  4  under  the  assumption  that  the  local 
processors  have  the  full  knowledge  of  the  global  model  Because  of  the  importance  of  this 
problem  we  have  made  this  a  self  contained  chapter.  A  brief  review  of  the  case  of 
uncorrelated  sensor  noise  is  given  in  Section  4.2  The  correlated  sensor  noise  is  dealt  with 
in  Section  4.3.  This  problem  is  first  solved  by  transmitting  all  the  sensor  data  to  a  central 
node  and  the  results  are  given  in  Equation  (4  10)  Next,  the  technique  of  decorreiating  the 
measurements  from  sensor  to  sensor  is  demonstrated  in  Equation  (4  12)  In  this  technique, 
the  observed  data  is  transformed  into  "new  measurements"  so  that  the  data  is  uncorrelated 
across  the  sensors,  but  the  solution  depends  upon  the  simultaneous  availability  of  all  the 
data  The  distributed  algorithm  is  presented  in  Equation  (4.13)  and  Figure  4.1.  This  is  a 
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remarkably  simple  algorithm  There  is  no  explicit  need  of  constructing  a  Kalman  filter  at 
the  nodes  and  the  initial  conditions  at  the  local  nodes  can  be  set  to  zero.  Each  local  node 
must  send  to  the  global  node  only  one  data  dependent  vector  the  property  of  which  is  yet  to 
be  found  out. 

In  many  situations,  panicularly  if  the  variables  are  expressed  in  the  cartesian 
coordinates,  the  dynamics  equations  for  the  target  state  is  linear  but  the  measurement 
equations  are  nonlinear  This  class  of  problem  is  usually  solved  by  using  extended  Kalman 
filters  (EKF).  In  most  of  the  cases,  the  implementation  is  based  upon  ad  hoc  expansions 
and  lineanzation  techniques.  In  this  project,  we  have  dealt  with  a  special  class  of  nonlinear 
measurements  encountered  in  an  SDI  tracking  problem  known  as  modifiable  nonlinearines 
The  resulting  filter  is  called  the  modifiable  gain  EKF  (MGEKF)  and  has  been  dealt  with  in 
Chapter  5.  In  the  case  of  beanngs  only  measurements,  it  is  anticipated  that  significant 
improvement  in  filter  response  and  stability  will  result  in  the  MGEKF  formulation  in 
contrast  to  that  of  the  EKF.  Since  this  is  a  new  idea,  this  chapter  has  been  wntten  in  a 
tutonal  fonn  The  main  result  is  given  in  Section  5.6  -  5.7.  The  decentralized  version  of 
the  results  are  presented  in  Equations  (5.38)  -  (5.45).  Tliese  are  the  preliminary  results  and 
will  be  analyzed  in  detail  in  Phase  11. 

6.2  Future  Recommendations 

It  IS  obvious  from  the  earlier  pans  of  this  repon  that  the  decentralized  estimation 
problem  in  presence  of  correlated  sensor  noise  is  indeed  a  very  difficult  one.  We  have 
solved  a  very  simple  version  of  this  problem  which  assumes  linear  dynamics  and 
measurement  equations,  identical  global  and  local  models,  no  computational  and 
transmission  delay,  all  nodes  in  broadcast  mode  etc.  Therefore  much  work  remains  to  be 
done  which  v.'ill  form  the  basis  of  Phase  11.  The  future  recommendations  include  but  are 
not  necessarily  limited  to  the  following. 
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1 .  In  a  realistic  environment,  the  local  processors  may  not  possibly  have  the 
complete  knowledge  of  the  global  model  Therefore  the  decentralized 
estimation  problem  in  presence  of  the  correlated  sensor  noise  must  be 
formulated  for  the  case  of  dissimilar  glo''al  a  id  local  models. 

2.  In  Chapter  4,  we  have  solved  this  problem  only  for  the  continuous  time  case. 
This  analysis  needs  to  be  extended  to  the  discrete  time  case 

3.  The  IR  sensors  used  in  the  SDl  system  typically  provide  information  on 
bearings  of  the  targets.  Radar  and  laser  trackers  can  provide  additional 
information  on  range  and  range-rate  In  any  event,  if  the  motion  of  the  target  is 
described  in  a  rectangular  coordinate  system,  the  above  measurements  are 
related  nonlinearly  to  the  state  variables  of  the  target.  We  have  proposed  in 
Chapter  5  that  bearings  only  measurements  can  best  be  handled  in  a  newly 
introduced  framework  called  "modifiable  functions."  This  is  a  very  powerful 
technique  and  holds  a  great  potential  in  solving  the  decentralized  estimation 
problem  with  nonlinear  measurements.  So  far  the  results  are  available  for 
centralized  estimation  only  and  we  have  indicated  in  Chapter  5  how  to  proceed 
for  the  decentralized  estimation.  Tnis  problem  needs  to  be  formulated  and 
analyzed  in  detail. 

4  In  a  realistic  situation,  there  is  a  delay  in  transmission  from  one  node  to  the 
other  and  there  is  also  a  delay  associated  with  the  computation.  In  Phase  I,  we 
have  assumed  these  delays  to  be  zero.  Time  delays  due  to  computation  and 
communication  processes  must  be  incorporated  in  the  decentralized  estimation 
problem. 
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5.  All  the  results  of  this  repon  have  been  derived  for  one  target  onl>.  This 
scenano  needs  to  be  broadened  to  multiple  targets  in  future  work 

6  We  have  assumed  that  all  the  sensors  are  at  fixed  positions  in  an  inertial  frame 
while  the  target  is  moving  with  respect  to  this  fran.e.  But  in  an  actual  SDI 
scenano,  the  sensors  are  located  in  orbiting  satellites  and  therefore  have  their 
own  dynamics.  Sensor  dynamics  must  be  included  in  the  future  formulation  of 
the  decentralized  cstimanon  problem. 

7  We  have  assumed  that  the  satellites  carrying  the  sensors  are  communicating  in  a 
broadcast  mode.  But  in  practice,  it  :s  more  likely  that  direct  communications 
will  take  place  only  between  adjacent  satellites  arranged  in  a  nng  semeture  about 
the  Earth  and  communications  will  be  allowed  in  either  direction.  The 
decentralized  estimation  problem  needs  to  be  formulated  by  incorporating  the 
communication  protocol  of  a  nng  structure 

8.  We  have  argued  while  establishing  the  superionty  of  the  decentralized 
estimation  over  the  centralized  one  that  loss  of  the  centralized  processor  could 
result  in  total  or  at  least  senous  degradation  of  U’acking  capability.  It  will  most 
likely  turn  out  that  distnbuted  estimation  will  result  in  a  more  survivable  and 
less  complex  tracking  system  It  needs  to  be  investigated  how  the  system 
penormance  degrades  in  the  event  of  one  or  more  sensor  failure.  Two  types  of 
sensor  failure  must  be  considered,  soft  (degraded)  failure  and  hard  failure 
Soft  failures  can  be  modelled  via  use  of  statistical  measurement  noise  models, 
while  hard  failures  can  be  modelled  by  assuming  complete  loss  of 
measurement. 
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Communication  failure  models  should  also  be  included  in  the  decentralized 
estimation  problem  \s  in  the  case  of  sensor  failure,  communication  failure  can 
also  be  modelled  as  either  hard  or  soft  A  hard  failure,  for  example,  could 
result  from  successful  jamming  or  failure  of  the  link  to  ooera'e.  Soft  failures 
can  be  handled  using  stansncal  measurement  noise  models. 
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